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OM nucleic 
Run on: 



nucleic search^ using sw model 

September 28, 2004, 20:34:22 ; Search time 6775 Seconds 

(without alignments) 
7682.624 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



US-10-069-541-5 
1743 

1 atggctttccatgtggaagg ctgaagataatttacagtga 1743 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 

27513289 seqs, 14931090276 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



55026578 



Database : 



EST: 

1 
2 
3 
4 
5 
6 
7 
8 
9 

10: 
11: 
12: 
13: 
14: 
15: 
16: 
17: 
18: 
19: 
20: 
21: 
22: 
23: 
24: 
25: 
26: 
27: 



em_estba : * 
em_esthum: * 
em_estin: * 
em_estmu: * 
em_estov: * 
em_estpl : * 
em_estro: * 
em_htc : * 
gb_estl : * 
gb_est2 : * 
gb_htc: * 
gb_est3 : * 
gb_est4 : * 
gb_est5 : * 
em__estf un : * 
em_estom: * 
em_gss_hum: * 
em__gss_inv: * 
em_gss_pln : * 
em_gss_vrt : * 
em_gss_f un : * 
em_g s s_mam : * 
em_gss_mus : * 
em_gss_pro: * 
em_gss_rod: * 
em_gss_phg: * 
em_gss_vrl: * 



28: gb_gssl:* 
29: gb_gss2:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed^ 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



lesult 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 




1 


1473 


84.5 


1743 


29 


AY413298 


AY413298 Homo sapi 




2 


1463 


83.9 


1743 


29 


AY413299 


AY413299 Pan trogl 




3 


1375 


78.9 


4097 


11 


AK053063 


AK053063 Mus muscu 




4 


1375 


78.9 


4306 


11 


AK034415 


AK034 415 Mus muscu 




5 


1156.2 


66.3 


1743 


29 


AY413300 


AY413300 Mus muscu 




6 


518.8 


29.8 


672 


29 


AG157499 


AG157499 Pan trogl 




7 


472.2 


27.1 


707 


14 


CD350164 


CD350164 UI-M-FYO- 




8 


462.8 


26.6 


669 


13 


BY727598 


BY727598 BY727598 




9 


404 


23.2 


516 


10 


BE233479 


BE233479 139685 MA 




10 


329.8 


18.9 


650 


10 


BB626260 


BB626260 BB626260 




11 


312.8 


17.9 


541 


10 


AW668962 


AW668962 111664 MA 




12 


290 


16.6 


675 


13 


BY729567 


BY729567 BY729567 




13 


274.2 


15.7 


524 


10 


BE723927 


BE723927 198406 MA 




14 


263.2 


15.1 


800 


9 


AL669749 


7VL669749 AL669749 




15 


225.8 


13.0 


549 


13 


BW274870 


BW274870 BW274870 




16 


212.4 


12.2 


1037 


9 


AL666817 


AL666817 AL666817 




17 


210 


12.0 


941 


14 


CD360297 


CD360297 AGENCOURT 




18 


209.2 


12.0 


641 


12 


BI630566 


BI630566 RH59836.5 




19 


207.4 


11.9 


640 


12 


BI629504 


BI629504 RH58381.5 




20 


205.2 


11. 8 


658 


12 


BM629925 


BM629925 170006875 




21 


203.2 


11.7 


652 


10 


BB626456 


BB626456 BB626456 




22 


193.6 


11.1 


583 


13 


BW277281 


BW277281 BW277281 




23 


192.6 


11.0 


605 


13 


BQ829470 


BQ829470 LL6in2176 




24 


186.4 


10.7 


624 


12 


BJ122485 


BJ122485 BJ122485 




25 


183 


10.5 


681 


14 


CD306544 


CD306544 StrPu691. 




26 


178.6 


10.2 


565 


12 


BJ125564 


BJ125564 BJ125564 




27 


177.4 


10.2 


576 


14 


CB391304 


CB391304 OSTF149A8 




28 


176.6 


10.1 


310 


9 


AL918603 


AL918603 AL918603 




29 


166 


9.5 


604 


9 


AU199794 


AU199794 AU199794 




30 


163.4 


9.4 


500 


9 


AV994375 


AV994375 AV994375 




31 


163.2 


9.4 


555 


12 


BJ117801 


BJ117801 BJ117801 




32 


158.4 


9.1 


646 


9 


AB078155 


AB078155 AB078155 


c 


33 


157.4 


9.0 


801 


13 


BW002036 


BW002036 BW002036 


c 


34 


155 


8.9 


561 


28 


AQ316435 


AQ316435 RPCIll-10 




35 


153 


8.8 


584 


12 


BJ105382 


BJ105382 BJ105382 




36 


145.2 


8.3 


500 


12 


BP186503 


BP186503 BP186503 


c 


37 


134 


7.7 


632 


28 


AZ612750 


AZ612750 1M0439J17 




38 


128.2 


7.4 


500 


12 


BJ105730 


BJ105730 BJ105730 


c 


39 


127.6 


7.3 


618 


28 


AZ908709 


AZ908709 RPCI-24-2 




40 


121 


6.9 


926 


29 


CNS04L3J 


AL295624 Tetraodon 


c 


41 


117.2 


6.7 


525 


12 


BI508286 


BI508286 BB170004A 


c 


42 


117,2 


6.7 


530 


12 


BI503332 


BI503332 BB170012A 




43 


112.8 


6.5 


355 


9 


AU209671 


AU209671 AU209671 


c 


44 


112.4 


6.4 


558 


12 


BI507950 


BI507950 BB170010A 


c 


45 


108 


6.2 


420 


12 


BI506529 


BI506529 BB170027B 



ALIGNMENTS 



RESULT 1 
AY413298 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

COMMENT 

FEATURES 

source 



gene 

ORIGIN 



AY413298 1743 bp DNA linear GSS 12-DEC-2003 

Homo sapiens HCM4844 gene, VIRTUAL TRANSCRIPT, partial sequence, 
genomic survey sequence. 
AY413298 

AY413298.1 GI: 39769260 
GSS. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1743) 

Clark, A. G., Glanowski , S . , Nielson,R., Thomas, P., Kej ariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 

Inferring nonneutral evolution from human- chimp-mouse orthologous 

gene trios 

Science 302 (5652), 1960-1963 (2003) 
14671302 

2 (bases 1 to 1743) 

Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., Kej ariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 

Direct Submission 

Submitted (16-NOV-2003) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence was made by sequencing genomic exons and ordering 
them based on alignment. 

Location/Qualifiers 

1. .1743 

/organism="Homo sapiens" 

/mol_type=" genomic DNA" 
/db_xref="taxon: 9606" 
<1. .>1743 

/locus_tag="HCM4 844" 



Query Match 84.5%; 
Best Local Similarity 84.5%; 
Matches 1473; Conservative 



Score 1473; DB 29; Length 1743; 
Pred. No. 0; 
0; Mismatches 270; Indels 0; Gaps 



0; 



Qy 

Db 



1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I 

1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 



Qy 

Db 



61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

I I I I I I I I I I I I I I M I I I M I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 
61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 



Qy 


121 


GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 

1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGNN 


180 


Db 


121 


180 


Qy 


181 


ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 


240 


Db 


181 


NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 


240 


Qy 


241 


GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 


300 


Db 


241 


NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 


300 


Qy 


301 


TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 


360 


Db 


301 


NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 


360 


Qy 


361 


ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGA7\ATGTTC 


420 


Db 


361 


NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 


420 


Qy 


421 


TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 

1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
NNNNNNNNNNNNNNNNNNNNNNNNNNNNGAGCCACCATCAGCGTGATCATCGATGTGGAT 


480 


Db 


421 


480 


Qy 

Db 


481 
481 


ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 

ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 


540 
540 


Qy 


541 


CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 
CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 


600 


Db 


541 


600 


Qy 

Db 


601 
601 


ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 
ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 


660 
660 


Qy 


661 


CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 
CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 


720 


Db 


661 


720 


Qy 
Db 


721 
721 


CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 


780 
780 


Qy 


781 


GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 

M 1 hi 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 i rl 1 1 

GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 


840 


Db 


781 


840 


Qy 

Db 


841 
841 


TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 

1 M 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 

TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 


900 
900 


Qy 

Db 


901 

901 


AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 

1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 
AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 


960 
960 


Qy 


961 


CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 


1020 



Db 


961 


Qv 


1021 


Db 


1021 


Qv 


1081 


Db 


1081 


Ov 


1141 


Db 


1141 


Ov 


1201 


Db 


1201 


Ov 


1261 


Db 


1261 


Ov 


1321 


Db 


1321 


Ov 


1381 


Db 


1381 


Ov 


1441 


Db 


1441 


Ov 


1501 


Db 


1501 


Ov 


1561 


Db 


1561 


Ov 


1621 


Db 


1621 


QY 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 



I I I I I I I I I I M I I I I M I I I M I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I 

CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 
TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

I I I I I I I M I I I I I I I I M M I I I I I I I M I I M I I N I I I I I M I I I I I I I I I I I I I I I 

TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 108 0 

CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

I M I I I I I I I I I I M I I I I I I I I I I M 11 I I I I I I I I I M I I I I I I I I I I M I I I I I I I I 
CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I 
ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

I I I I I I I I I I I I I I I I I I I I I M 11 I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I 
ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 12 60 

CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 132 0 

M I 1 I I I I I II I II I I I II I I I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I M I I I II 

CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

II II II II I M II II II II I II I II II II II II I II II II II II 1 I II I M I II II II II 
TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 138 0 

TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

I I I I I II M I I I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I II I I I I M I I I II I I I 

TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 144 0 

ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

M I I I I I I I I II I I I I I M I I II I M I I I I I I I I I II I I I M I I I II I I I I I I I I I I I I I 
ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I II I I I M I I I I I I 1 I I I I I I I I I I I M I 
CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

CACAGTGAAGAAAACATGGATT^GACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1 62 0 

II I I I I M M I I I I I M I I I I I I II I I I I I I I I I 1 M I M I I M I I I I I I 1 I I I II I I I I 

CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 168 0 

II I I I I I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 1 I I I I I I I I 
GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

I I II I I I I II I I I I I I I II I I I I I I I I M I I I II I I I I I II I I I II I I I 1 I I I I I I I I I I 
GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 



I I 



RESULT 2 



AY413299 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 



Craniata ; Vertebrata ; Euteleos tomi ; 
Catarrhini; Hominidae; Pan. 



TITLE 
JOURNAL 

COMMENT 

FEATURES 

source 



gene 
ORIGIN 



AY413299 1743 bp DNA linear GSS 12-DEC-2003 

Pan troglodytes HCM4844 gene, VIRTUAL TRANSCRIPT, partial sequence, 
genomic survey sequence. 

AY413299 

AY413299.1 GI: 39769261 
GSS. 

Pan troglodytes (chimpanzee) 
Pan troglodytes 
Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 

1 (bases 1 to 1743) 

Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., Ke j ariwal, A. , 
Todd, M. A., Tanenbaum,D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 

Inferring nonneutral evolution from human- chimp -mouse orthologous 
gene trios 

Science 302 (5652), 1960-1963 (2003) 
14671302 

2 (bases 1 to 1743) 

Clark, A. G., Glanowski, S . , Nielson,R., Thomas, P., Kejariwal, A. , 
Todd, M. A., Tanenbaum,D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 
Direct Submission 

Submitted ( 16-NOV-2003) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence was made by sequencing genomic exons and ordering 
them based on alignment. 

Location/Qualifiers 
1. .1743 

/organism="Pan troglodytes" 
/mol_type="genomic DNA" 
/db_xref="taxon: 9598" 
<1. .>1743 

/locus tag="HCM4844" 



Query Match 83.9%; 
Best Local Similarity 84.1%; 
Matches 1466; Conservative 



Score 1463; DB 29; Length 1743; 
Pred. No. 0; 
0; Mismatches 277; Indels 0; Gaps 



0; 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 



ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 

M i I I J I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I M I I I I I 
ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 



I I I I I I I I I I I I I M I I I I I I I I M M I I I I I I I I I I I I I I I M I I I I 1 I I I I I I I M I 



GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 

I I I I I I I I I M I I I I I M M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M M I I 

GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACANNN 



60 



60 



180 



180 



Qy 



181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 



Db 181 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 240 

Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

Db 241 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 300 

Qy 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

Db 301 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 360 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

Db 361 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 420 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I 

Db 421 NNNNNNNNNNNNNNNNNNNNNNNNNNNNGAGCCACCATCAGCGTGATCATCGATGTGGAT 4 80 

Qy 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

I I I I I I I I M M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I M I I I 
Db 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I I I M I I I M I I I I I I I I I I I I I I M M I I M M I I I I I I I I I I I I I I I I M I I I I I I M 

Db 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I M I M 

Db 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I M I I 

Db 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 72 0 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

I I I I I I I I I I I I I I I I I I I I I I I I 11 I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

I I I I M I I I I M I I I I I I I I I M I I I I M I I I I I I I I I I I M I I I I I I I I I M I I 

Db 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 84 0 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I 1 I I I I I I I I M I I I I I I I I M I I I I I I I I I 11 M I I I I I I I I I I I I M I I I { I I I M 

Db 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATCGGGGCCATTGGAGCATCAACAGACTGG 900 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I I I I I I I I I I I I I I I I I I I I I I M I I I IJ I M I I I I I I I I I I I I I I I I I M I I I M I I I I 

Db 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I M I I I I I M I I I I I M I I I I I I I I I I 
Db 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

I I I I I I I I I M I I I I M M I M I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M 

Db 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 



Qv 


1081 


Db 


1081 


Qv 

'X.J 


1141 


Db 


1141 


Ov 


1201 


Db 


1201 


Ov 


1261 


Db 


1261 


Ov 


1321 


Db 


1321 


Ov 


1381 


Db 


1381 


Ov 


1441 


Db 


1441 


Ov 


1501 


Db 


1501 


Ov 


1561 


Db 


1561 


Ov 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 



CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

I I I I I I I I I I I I I I I I I M I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGT^AATCGTTTGGGTT 114 0 

ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 12 00 

I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAG 1200 

ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

I I I I M I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I 

CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

TCTGGCCTCTTCCTGAG7VATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I I M I I I I I I I I M M I I I I M I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I M 

TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 14 40 

I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 



ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I N I I I I I III 
ACACTTGCCATGGTTACGTCATTCTTAACCAACATTTGCGTCTCCTATCTAGCCAAATAT 



1500 



1500 



1560 



CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 

M I I II I I II I I I I I I I I M I I I M I II I I I I I I I I I 11 I I I I I I I I I 

CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

I I II I I I I I II I I II I I I I I I I II I I I I I II I II I I I I I I I I I I I I I M I I I M I I I I II 
CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I I I I I I II I I II I II I I M I I II I I I I I I I M I I I I II I I I I I I I I I M I I I I I I I I II I 
GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

II I I I I I I I M I I I II M I I II II I I I I I I I M I I M I I M I I I I I I I I I I I I I I I I I I 

GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAANGGTCTGGGACTGAAGATAATTTACAG 174 0 



I I 



RESULT 3 
AK053063 

LOCUS AK053063 4097 bp mRNA linear HTC 20-SEP-2003 

DEFINITION Mus musculus 15 days embryo head cDNA, RIKEN full-length enriched 

library^ clone : D930038E20 product : solute carrier family 5 (choline 

transporter), member 1, full insert sequence. 
ACCESSION AK053063 
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Location/Qualif iers 

1. .4097 

/organism="Mus musculus" 
/mol_type="mRNA" 
/strain="C57BL/6J" 
/db_xref="FANTOM_DB:D930038E20" 
/db_xref="MGI : 2424012" 
/db_xref-"taxon: 10090" 
/clone="D930038E20" 
/tissue_type="head" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/dev_stage="15 days embryo" 
512. .2254 

/note=" unnamed protein product; putative 
solute carrier family 5 (choline transporter), member 7 
(MGDIMGI: 1927126, GB | NM_022025, evidence: BLASTN, 99%, 
match=1743) " 
/ codon_s tart=l 
/protein_id="BAC35253 . 1" 
/db_xref="GI : 26343193" 

/translation="MSFHVEGLVAIILFYLLIFLVGIWAAWKTKNSGNPEERSEAIIV 
GGRDIGLLVGGFTMTATWVGGGYINGTAEAVYGPGCGLAWAQAPIGYSLSLILGGLFF 
AKPMRSKGYVTMLDPFQQI YGKRMGGLLFI PALMGEMFWAAAI FSALGATI SVI I DVD 
VNISVIVSALIAILYTLVGGLYSVAYTDWQLFCIFIGLWISVPFALSHPAVTDIGFT 
AVHAKYQS PWLGTI ESVEVYTWLDNFLLLMLGGI PWQAYFQRVLS S S SAT YAQVLS FL 
AAFGCLVMALPAICIGAIGASTDWNQTAYGYPDPKTKEEADMILPIVLQYLCPVYISF 
FGLGAVS AAVMS S ADS S I LS AS SMFARNI YQL S FRQNAS DKEI VWVMRI TVLVFGASA 
TAMALLTKTVYGLWYLS SDLVYI 1 1 FPQLLCVLFI KGTNT YGAVAGYI FGLFLRITGG 
EPYLYLQPLIFYPGYYSDKNGIYNQRFPFKTLSMVTSFFTNICVSYLAKYLFESGTLP 
PKLDVFDAWARHSEENMDKTILVRNENIKLNELAPVKPRQSLTLSSTFTNKEALLDV 
DSSPEGSGTEDNLQ" 



ORIGIN 



Query Match 7 8.9%; 

Best Local Similarity 86,8%; 
Matches 1513; Conservative 



Score 1375; DB 11; 
Pred. No. 0; 
0; Mismatches 230; 



Length 4097; 
Indels 0; 



Gaps 



0; 



Qy 

Db 



512 



ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

IN I I I I I I I II I I I I I I I I I I I I I I I I III I I I I I I I I I II II II III 
ATGTCTTTCCACGTAGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 571 



Qy 


61 


Db 


572 


Qy 


121 


Db 


632 


Qy 


181 


Db 


692 


Qy 


241 


Db 


752 


Qy 


301 


Db 


812 


Qy 


361 


Db 


872 


Qy 


421 


Db 


932 


Qy 


481 


Db 


992 


Qy 


541 


Db 


1052 


Qy 


601 


Db 


1112 


Qy 


661 


Db 


1172 


Qy 


721 


Db 


1232 


Qy 


781 


Db 


1292 


Qy 


841 


Db 


1352 


Qy 


901 



GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

I I M I I M I I I I M I I I MM M M M M M M MM I I M M M M M M Ml 

GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGCAACCCAGAAGAGCGCAGTGAA 631 

GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

M M M M M I M M M I M M M M M I M M M M M M M M M M M M 

GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 691 

ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 2 4 0 

M M I M I M M I M I M M M M M M M I M M M M MM M M M I I 

ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGAAGCAGTGTATGGGCCAGGTTGT 751 

GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

M M M M I M M M M M M I M M M M M M M M M I M M M M M I Ml 

GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 811 

TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

M M M I M M M M M M M M M I M M M M M M M M M I M M I M 

TTTTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTTCAACAG 871 

ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

M M M M M I M M M M M I I M M M M M M M M M M M I M I M I 

ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGAGAGATGTTC 931 

TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 4 8 0 

M M M M M M M M M M M I M M M M M M M M M M I M I M M M M I 

TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCCACCATCAGCGTGATCATTGATGTGGAT 991 



M MM M M M I M M M M M M M M M Ml II II II Mill II 



CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I II M M II II M II I II M II I M II II I M II II II I II MM MUM 

CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTTTTATAGGACTGTGG 1111 

ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

Mill II I II Mill II II II II I I I I II II I I I M II II I II I II MUM 

ATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAGCTGTG 1171 

CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 72 0 

II I M M II II II I I I II II M II II I I I M I II II II M M II I II I 

CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 1231 

CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

II I M II I II II II I \' \ II II II II I II I I I II II II I II II II I Mill II II I I 

CTTGATAATTTTCTGTTATTGATGCTGGGTGGAATCCCATGGCAAGCCTACTTCCAGAGG 1291 

GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

II I II M II I II II II I II II II II II II I M II II II II M M II II II II III 

GTCCTCTCTTCATCCTCAGCCACCTATGCTCAGGTACTGTCCTTCCTGGCAGCTTTTGGG 1351 

TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

II II II II II II II I I I II II II I I I M II I M II I M II M I I I II I I 

TGCCTGGTGATGGCTCTACCCGCCATATGCATAGGAGCTATTGGAGCTTCCACAGACTGG 1411 
AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 



I I I I I I M I I I II III I II I I I I I II 1 I I I I II II M I I I I I II I I II I I 

Db 1412 AACCAGACTGCCTACGGGTATCCAGATCCCAAGACTAAGGAGGAAGCAGACATGATTCTC 1471 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 102 0 

II M M I I I I I I I M I I I I II I I I I I I I I II II I II I I I I I I II I I I I I III 

Db 1472 CCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGGCTTGGTGCTGTT 1531 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

II I I I I I I I I II II I I I I I I II II I I I I I I I I II II I I I I I I I I I I I I i 

Db 1532 TCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGTTCTATGTTTGCT 1591 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAT^TGCTTCGGACAAAGAAATCGTTTGGGTT 114 0 

Mill II I I I I I I I I I M I I M M M I I I I I I I I II I I I I I I I I I I II I I I I I 

Db 1592 CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 1651 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

III I II I I I III M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 

Db 1652 ATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGCCATGGCTTTGCTGACGAAG 1711 

Qy 12 01 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

II I I II I I I M I I I I I I I I I I I I II I I I I I I I I I II MINI I II I I I I I I III 

Db 1712 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1771 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

I I I I I II I I I I I I I I I I II I I I I I I 1 I I I I I I I I I I I I II II MINI II 
Db 1772 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1831 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I III II I II I I I I I II I I I I I I I I I I I I I I I I I I I I II I I II I M I I III 

Db 1832 TTTGGACTATTCCTGAGAATTACTGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1891 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I 

Db 1892 TTCTACCCTGGTTATTACTCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 1951 

Qy 14 41 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

II II I I I I I I I M I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I 

Db 1952 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCTTATCTAGCCAAGTAT 2011 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I I II I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I 
Db 2012 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATGTATTTGATGCTGTTGTCGCAAGG 2071 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

M I I M I I I II I I I I I I I I I I I I I I I I I I MM I I M I I I I M I I I I I I I I II 

Db 2072 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAATATCAAATTAAAT 2131 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCTVACTTTCACCAATAAA 1680 

I I I I I I I I I I I I I I I I M II I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I 
Db 2132 GAACTTGCACCTGTGAAACCTCGGCAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAG 2191 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 174 0 

I M II I II I I II I I I I I I I I I I I I I M I II I I I I I I I I I I I I I I I I I I I I Mill 
Db 2192 GAGGCCCTCCTTGATGTTGATTCCAGTCCGGAGGGGTCTGGGACTGAAGATAACTTACAA 2251 



Qy 



1741 TGA 1743 
I II 



Db 



2252 TGA 2254 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; Mus . 



RESULT 4 
7^i<034415 

LOCUS AK034415 4306 bp itiRNA linear HTC 18-SEP-2003 

DEFINITION Mus musculus adult male diencephalon cDNA, RIKEN full-length 

enriched library, clone : 933018 8K24 product : solute carrier family 5 
(choline transporter), member 1, full insert sequence. 
ACCESSION AK034415 

VERSION AK034415.1 GI: 26329926 

KEYWORDS HTC; CAP trapper. 
SOURCE Mus musculus (house mouse) 

ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
REFERENCE 1 

AUTHORS Carninci,P. and Hayashizaki, Y. 
TITLE High-efficiency full-length cDNA cloning 

JOURNAL Meth. Enzymol . 303, 19-44 (1999) 
MEDLINE 99279253 
PUBMED 10349636 
REFERENCE 2 

AUTHORS Carninci,?., Shibata,Y., Hayatsu,N,, Sugahara,Y., Shibata,K., 

Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 
TITLE Normalization and subtraction of cap-trapper-selected cDNAs to 

prepare full-length cDNA libraries for rapid discovery of new genes 
JOURNAIj Genome Res. 10 (10), 1617-1630 (2000) 
MEDLINE 20499374 
PUBMED 11042159 
REFERENCE 3 

AUTHORS Shibata,K., Itoh,M. , Aizawa,K., Nagaoka,S., Sasaki, N., Carninci,?., 
Konno,H., Akiyama,J., Nishi,K., Kitsunai,T., Tashiro,H., Itoh,M., 
Sumi,N., Ishii,Y., Nakamura,S., Hazama,M., Nishine,T., Harada,A. , 
Yamamoto,R., Matsumoto, H . , Sakaguchi , S , , Ikegami,T., Kashiwagi, K. , 
Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., Watahiki,M., 
Yoneda,Y., Ishikawa,T., Ozawa,K., Tanaka,T., Matsuura,S., Kawai,J., 
Okazaki,Y., Muramatsu,M. , Inoue,Y., Kira,A. and Hayashizaki,Y. 

TITLE RIKEN integrated sequence analysis (RISA) system — 384-format 

sequencing pipeline with 384 multicapillary sequencer 

JOURNAL Genome Res. 10 (11), 1757-1771 (2000) 

MEDLINE 20530913 
PUBMED 11076861 
REFERENCE 4 

AUTHORS The RIKEN Genome Exploration Research Group Phase II Team and the 

FANTOM Consortium. . ' 

TITLE Functional annotation of a full-length mouse cDNA collection 

JOURNAL Nature 409, 685-690 (2001) 
REFERENCE 5 

AUTHORS The FANTOM Consortium and the RIKEN Genome Exploration Research 

Group Phase I & II Team. 
TITLE Analysis of the mouse transcriptome based on functional annotation 

of 60,770 full-length cDNAs 
JOURNAL Nature 420, 563-573 (2002) 
REFERENCE 6 (bases 1 to 4306) 

AUTHORS Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Bono,H., Carninci,P., 

Fukuda,S., Furuno,M., Hanagaki,T., Hara,A. , Hashizume, W. , 



Hayashida, K. , Hayatsu^N., Hiramoto, K. , Hiraoka,T., Hirozane^T., 
Hori,F., imotani^K., Ishii^Y., Itoh,M., Kagawa,!., Kasukawa,T., 
Katoh,H., Kawai^J., Kojima,Y., Kondo^S., Konno^H,, Kouda,M./ 
Koya^S., Kurihara^C, Matsuyama, T . , Miyazaki,A. , Murata^M., 
Nakamura^M., Nishi^K., Nomura, K., Numazaki , R. , Ohno^M., Ohsato,N., 
Okazaki^Y., Saito,R., Saitoh, H., Sakai^C, Sakai^K., Sakazume,N., 
Sano^H., Sasaki, D., Shibata^K., Shinagawa, A. , Shiraki^T., 
Sogabe^Y., Tagami^M., Tagawa^A. , Takahashi , F. , Takaku-Akahira, S . , 
Takeda^Y., Tanaka,T., Tomaru^A, , Toya^T., Yasunishi , A. , 
Muramatsu^M. and Hayashizaki, Y, 
TITLE Direct Submission 

JOURNAL Submitted ( 16- JUL-2001 ) Yoshihide Hayashizaki, The Institute of 
Physical and Chemical Research (RIKEN) , Laboratory for Genome 
Exploration Research Group, RIKEN Genomic Sciences Center (GSC) , 
RIKEN Yokohama Institute; 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, 
Kanagawa 230-0045, Japan (E-mail : genome-res@gsc. riken. go. jp, 
URL :http: //genome. gsc. riken. go. jp/, Tel : 81-4 5-5 03-9222, 
Fax: 81-45-503-9216) 
COMMENT cDNA library was prepared and sequenced in Mouse Genome 

Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site for further details. 
URL : http : / /genome . gsc. riken . go . jp/ 
URLrhttp: // fantom. gsc. riken. go. jp/ . 
FEATURES Location/Qualifiers 
source 1. .4306 

/organism="Mus mus cuius" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db_xref="FANTOM_DB: 9330188K24" 

/ db_xr e f = "MGI :2398619" 

/db_xref="taxon: 10090" 

/clone="9330188K24" 

/ sex="male" 

/ tissue_type="diencephalon" 

/clone_lib="RIKEN full-length enriched mouse cDNA library" 
/ dev_stage="adult" 
CDS 394. .2136 

/note="unnamed protein product; putative 

solute carrier family 5 (choline transporter), member 7 
(MGDIMGI: 1927126, GB | NM_022025, evidence: BLASTN, 99%, 
match=1743) " 
/ codon_s tart=l 
/protein_id="BAC28702 . 1" 
/db_xref="GI: 26329927" 

/translation="MSFHVEGLVAIILFYLLIFLVGIWAAWKTKNSGNPEERSEAIIV 
GGRDIGLLVGGFTMTATWVGGGYINGTAEAVYGPGCGLAWAQAPIGYSLSLILGGLFF 
7\KPMRSKGYVTMLDPFQQI YGKRMGGLLFI PALMGEMFWAAAI FSALGATI SVI I DVD 
VNISVIVSALIAILYTLVGGLYSVAYTDWQLFCIFIGLWISVPFALSHPAVTDIGFT 
AVHAKYQS PWLGTI ES VEVYTWLDNFLLLMLGGI PWQAYFQRVLS S S SAT YAQVLS FL 
AAFGCLVMALPAICIGAIGASTDWNQTAYGYPDPKTKEEADMILPIVLQYLCPVYISF 
FGLGAVS AAVMS SAD S S I L S AS SMFARN I YQLS FRQNAS DKE I VWVMRI T VLVFGAS A 
TAMALLTKTVYGLWYLSSDLVYII I FPQLLCVLFIKGTNTYGAVAGYI FGLFLRITGG 
EPYLYLQPLIFYPGYYSDKNGIYNQRFPFKTLSMVTSFFTNICVSYLAKYLFESGTLP 
PKLDVFDAWARHSEENMDKTILVRNENIKLNELAPVKPRQSLTLSSTFTNKEALLDV 



ORIGIN 



DSSPEGSGTEDNLQ" 



Query Match 78.9%; Score 1375; DB 11; Length 4306; 

Best Local Similarity 86.8%; Pred. No. 0; 

Matches 1513; Conservative 0; Mismatches 230; Indels 0; Gaps 0; 

Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

III I I I M I I II I M I I I I I I I I I I I I I III I I I I I I I I I II II II III 
Db 394 ATGTCTTTCCACGTAGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 453 

Qy 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

I II M II I I I II I I I II I I II I I I I I I I I I I I I I II I I I I I I I I II I I I I I III 

Db 454 GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGCAACCCAGAAGAGCGCAGTGAA 513 

Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

M II I I I I I I I II I II I I II I I I I I I I I I I I I I I I I M I I I M I I I I I I II I I 
Db 514 GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 573 

Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

II I II I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I II I I I I I M I 

Db 574 ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGAAGCAGTGTATGGGCCAGGTTGT 633 

Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

II II I I I I I M I I I I I I I I I I I I I I I I I I I I I I I II Mill I I I I I I I I I I I III 
Db 634 GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 693 

Qy 301 TTCTTTGCAAAACCTATGCGTTCAT^GGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

II I I I I I I I I I I I M I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I II 
Db 694 TTTTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTTCAACAG 753 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I I I I I I I I I I I I I I I I I I II II II II II I I I I I I I I I I I II I I I I I I I I I I 
Db 754 ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGAGAGATGTTC 813 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I I I I II I II I I I I II I I I I I I I II M I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

Db 814 TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCCACCATCAGCGTGATCATTGATGTGGAT 873 

Qy 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

II I I I I M I I I I I I I I I I I I I I I II I I II I I III II II II I I I I I III 
Db 874 GTGAACATATCGGTCATTGTCTCTGCACTCATTGCCATTCTTTATACCCTAGTGGGTGGG 933 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I II I I I I I I II M II I I I I I I I I II I I I I I II I I I I II I I I I I I I I I II I I 
Db 934 CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTTTTATAGGACTGTGG 993 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

Mill I I I I I Mill I II II I II M I II II I I I M II I I II II I II I II I I I 
Db 994 ATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAGCTGTG 1053 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I I II I M II I I II I II II I II I I I I I I I I I I III II I II I I I I I I III 
Db 1054 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 1113 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

M I I M I I II II I M I I I II I M M II I II II II M I II II II I I Mill I II II I 



Db 1114 CTTGATAATTTTCTGTTATTGATGCTGGGTGGAATCCCATGGCAAGCCTACTTCCAGAGG 1173 

Qy 7 81 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 84 0 

II M I I II I I I I I I I I II M I I I I I I II M II I I I I I I I I I I M I I I I I II i Ml 

Db 1174 GTCCTCTCTTCATCCTCAGCCACCTATGCTCAGGTACTGTCCTTCCTGGCAGCTTTTGGG 1233 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

II II I II I I I I II I I II I I I I I I III II II I I I I I M I II I I I I I I I I I 

Db 1234 TGCCTGGTGATGGCTCTACCCGCCATATGCATAGGAGCTATTGGAGCTTCCACAGACTGG 12 93 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I I II I I II I I I II III II I I I I I I I I I I I I I I I M II I I I I M I I I I I I I 

Db 1294 AACCAGACTGCCTACGGGTATCCAGATCCCAAGACTAAGGAGGAAGCAGACATGATTCTC 1353 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II II I I M I I I I I I I I I II I II I I I I II I II II I I I I I I I I M M I I I I III 

Db 1354 CCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGGCTTGGTGCTGTT 1413 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

II I I I I I I I I I I I II I I I I I II II I I II I I I II I II I I I I I I I II I M I 

Db 1414 TCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGTTCTATGTTTGCT 1473 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I 

Db 1474 CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 1533 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

III I I I I I I III I I I I I I I I I I I M I I I M M I I I I I I I I I I I I I I I M I I II 

Db 1534 ATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGCCATGGCTTTGCTGACGAAG 1593 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

I I I M I I I I I I I I I I I I I I I M I II I I I I I I I I I I I II I I I I I I I I M M I III 
Db 1594 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1653 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I II I I I II II I I I I I I II 
Db 1654 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1713 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I I I I II I I I I I I I I I I I I I I M I II I I M I I II I I I II I I I I I I I I I III 

Db 1714 TTTGGACTATTCCTGAGAATTACTGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1773 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

I I I I I I I I I I I I I I I I I MM I II I M I II I II I II I I II II I I I II M II 

Db 1774 TTCTACCCTGGTTATTACTCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 1833 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

II II I M II II I I I I I II II II I M I I I I I I II I II M I I I II II I M I I I 

Db 1834 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCTTATCTAGCCAAGTAT 18 93 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I 1 M II I II I II II I II I II I I II I I I II I II I I I M II II II I I M I I I II II I 

Db 1894 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATGTATTTGATGCTGTTGTCGCAAGG 1953 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

II I I II II I M I I I M I II II II I Mill MM II II I II II I II I I I I M II 

Db 1954 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAATATCAAATTAAAT 2013 



Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I I I I I I I I I I I I I I I I II II I II II I I I I I I II I I II I I I I I II I M I I I I I 
Db 2014 GAACTTGCACCTGTGAAACCTCGGCAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAG 2 073 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 17 40 

II II II I I I I I I I M I M II I M I I I II II M I I II M I I I I I I I I I I I I I II II 
Db 2074 GAGGCCCTCCTTGATGTTGATTCCAGTCCGGAGGGGTCTGGGACTGAAGATAACTTACAA 2133 

Qy 1741 TGA 1743 

III 

Db 2134 TGA 2136 
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AY413300 1743 bp DNA linear GSS 12-DEC-2003 

Mus musculus HCM4844 gene, VIRTUAL TRANSCRIPT, partial sequence, 
genomic survey sequence. 
AY413300 

AY4 13300.1 GI: 39769262 
GSS. 

Mus musculus (house mouse) 

Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

1 (bases 1 to 1743) 

Clark, A. G., Glanowski , S . , Nielson,R., Thomas,?., Kejariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J . , 
Adams, M.D. and Cargill,M. 

Inferring nonneutral evolution from human- chimp -mouse orthologous 

gene trios 

Science 302 (5652), 1960-1963 (2003) 
14671302 

2 (bases 1 to 1743) 

Clark, A. G., Glanowski , S . , Nielson,R,, Thomas, P., Ke j ariwal, A. , 
Todd, M. A., Tanenbaum, D.M. , Civello, D . R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H,, White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 

Direct Submission 

Submitted ( 16-NOV~2003) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence was made by sequencing genomic exons and ordering 
them based on alignment. 

Location/Qualifiers 

1. .1743 

/organism="Mus musculus" 
/mol_type="genomic DNA" 
/db_xref="taxon: 10090" 
<1. .>1743 

/ lo cus_t ag="HCM4 844" 



Query Match 66.3%; Score 1156.2; DB 29; Length 1743; 

Best Local Similarity 73.1%; Pred. No. 3.8e-304; 

Matches 1275; Conservative 0; Mismatches 468; Indels 0; Gaps 



0; 



1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

III II II I I I II M II I I I M M I M I I III I I I M II II II II II III 

1 ATGTCTTTCCACGTAGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 60 
61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 12 0 

II II I I I M II II II M I I I I II I M II I M M I I I I I MINIMI III III 

61 GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGCAACCCAGAAGAGCACAGTGAA 120 

121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

II I M M II II II I II I I II I I M M M I I I M I I II II M M M I M I M I 

121 GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGNN 180 

181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 24 0 

181 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 240 

241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

241 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 3 0 0 

301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

301 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 360 

361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 42 0 

361 NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN 420 

421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 48 0 

I II I M I II II II I I M II M I II M I II I 
421 NNNNNNNNNNNNNNNNNNNNNNNNNNNNGGGCCACCATCAGCGTGATCATTGATGTGGAT 480 

481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 54 0 

II MM II Mill I M M I M M I M M II I III II II II I M II Ml 

481 GTGAACATATCGGTCATTGTCTCTGCACTCATTGCCATTCTTTATACCCTAGTGGGTGGG 54 0 
541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

Mill I I II II II II II II I II I II Mill II II M I II II II II II I I M 

541 CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTTTTATAGGACTGTGG 600 

601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

II II I II II I II I II I I I II M I I II I II I II I II I M I II II II I I II M I 
601 ATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAGCTGTG 660 

661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

II II I I II II I II I II II II II I M II II I I II I II II I II II I I II I 
661 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 720 

721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

I M M M II II I M I I M II II II II II II I I M II M II II M I Mill M I I M 

721 CTTGATAATTTTCTGTTATTGATGCTGGGTGGAATCCCATGGCAAGCCTACTTCCAGAGG 780 

781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

II I II II I M II II II II II I II II II I I I II M I I I II I M I I M I I II II III 

781 GTCCTCTCTTCATCCTCAGCCACCTATGCTCAGGTACTGTCCTTCCTGGCAGCTTTTGGG 840 



Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I I I I I M I I I M I I II MINI III II II II I II I II II I II I I I I I I 

841 TGCCTGGTGATGGCTCTACCCGCCATATGCATAGGAGCTATTGGAGCTTCCACAGACTGG 900 
Qy 901 AACCAGACTGCATATGGGCTTCCAGATCGCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

II I II M II II II III I I M I II I I I I I I M I I M II I II I M II M I I I 

Db 901 AACCAGACTGCCTACGGGTATCCAGATCCCAAGACTAAGGAGGAAGCAGACATGATTCTC 960 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II II I I I I I I I I I II I I I II I I I I I I I I I II II I I I I I I I I I I M I II I III 
Db 961 CCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGGCTTGGTGCTGTT 1020 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

II I I M II I I I I I II I I I I I II II I I I I I I MM II Mill II II II II 
Db 1021 TCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGTTCTATGTTTGCT 1080 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 114 0 

Mill II I I M I I I I I I II I I I I I I II I I M I II II Mill II I I I II I II II 
Db 1081 CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACi\AGGAAATTGTGTGGGTC 1140 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

IN I Mill III II II II I II I II I II I II I I I I I II II I II II II I II M I I 
Db 1141 ATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGCCATGGCTTTGCTGACGAAG 1200 

Qy 12 01 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

I I I I II I I II M I I I II I I I I II II II II I II II I I II I I M I M I I II I I Ml 
Db 12 01 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1260 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

Mill II II II M II I I II I II I II I II II M II M II II M I I I II I II 

Db 1261 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1320 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I I It M II II I I II II I I M II I I I I I I I I I I I I II II I M I II I I I III 
Db 1321 TTTGGACTATTCCTGAGi^ATTACTGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1380 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

M II M II II I II II II II II I II I I II I II II II M II I II II I M II II 

Db 1381 TTCTACCCTGGTTATTACTCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 144 0 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

M II I II I M I II I II II II I I I I I I I M II I I I II II M I I II II I I II I 

Db 1441 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCTTATCTAGCCAAGTAT 1500 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

Mill ill I II II I II II I II I I II II I I II I I I M I I I II II II I I I I M II Ml 
Db 1501 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATGTATTTGATGCTGTTGTCGCAAGG 1560 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 162 0 

M M II II I II I I II I II I MM! II I II I I II M II I II I II II II I M I II 
Db 1561 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAATATCAAATTAAAT 1620 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

M M II II M II I II I II II I I I I M I I I M II M M II II M M I II I II I 
Db 1621 GAACTTGCACCTGTGAT^CCTCGGCAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAG 1680 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 



MINI I I I I II I I I I I I II I M I I II I II I II I M I I I I I I I II I I I II III II 

Db 1681 GAGGCCCTCCTTGATGTTGATTCCAGTCCGGAGGGGTCTGGGACTGAAGATAACTTACAA 1740 

Qy 1741 TGA 1743 

I I I 

Db 1741 TGA 1743 



RESULT 6 
AG157499 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORG7\NISM 



REFERENCE 
AUTHORS 

TITLE 
JOURNAL 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



ORIGIN 



AG157499 672 bp DNA linear GSS 09-JAN-2002 

Pan troglodytes DNA, clone: RP43-022H02 . T7 , genomic survey 
sequence . 
AG157499 

AG157499.1 GI: 16687177 
GSS. 

Pan troglodytes (chimpanzee) 
Pan troglodytes 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Pan. 

1 

Fujiyama, A., Hattori,M., Toyoda,A. , Taylor, T.D., Yada,T., 
Totoki,Y., Watanabe,H. and Sakaki,Y. 

BAC end sequences of Library RPCI-43 

Unpublished 

2 (bases 1 to 672) 

Fujiyama, A., Hattori,M,, Toyoda,A. , Taylor, T.D., Yada,T., 
Totoki,Y., Watanabe,H. and Sakaki,Y. 

Direct Submission 

Submitted ( 02-AUG-2 001 ) Asao Fujiyama, The Institute of Physical 
and Chemical Research (RIKEN) , Genomic Sciences Center (GSC) ; 
1-7-22 Suehiro-chou, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan 
(E-mail : chimpbes(agsc . riken . go . jp, URL : http : //hgp . gsc . riken . go . jp/ , 
Tel: 81-45-503-9111, Fax : 81-4 5-503-9170) 

Clones are derived from the chimpanzee BAC library RPCI-43 This BAC 
end was generated during the R&D process and may have higher chance 
of clone tracking errors. 
PRIMERS 

Sequencing: T7 
LIBRARY 

Vector : pBACe3 . 6 
R.Site 1 : EcoRI 
R.Site 2 : EcoRI. 

Location/Qualifiers 
1. .672 

/organism^--"Pan troglodytes" 
/mol_type="genomic DNA" 
/db_xref="taxon: 9598" 
/clone="RP43-022H02.T7" 
/sex="male" 

/ cell_type="lymphocytes " 

/clone_lib="RPCI-43 Chimpanzee Male BAC Library" 



Query Match 29.8%; Score 518.8; DB 29; Length 672; 

Best Local Similarity 97.3%; Pred. No. 3.5e-130; 

Matches 549; Conservative 0; Mismatches 12; Indels 3; Gaps 



2; 



Qy 1110 AAATGCTTCGGAC7WVGAAATCGTTTGGGTTATGCG7\ATCACAGTGTTTGTGTTTGGAGC 1169 

I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 111 ACAGGCTTCGGACAAAGAAATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGC 170 

Qy 1170 ATCTGCTUVCAGCCATGGCCTTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTC 1229 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 171 ATCTGCAACAGCCATGGCCTTGCTGACGAAGACTGTGTATGGGCTCTGGTACCTCAGTTC 230 

Qy 1230 TGACCTTGTTTACATCGTTATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAAC 1289 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 231 TGACCTTGTTTACATCGTTATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAAC 290 

Qy 1290 CAACACCTATGGGGCCGTGGCAGGTTATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGG 1349 

I I M I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 291 CAACACCTATGGGGCCGTGGCAGGTTATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGG 350 

Qy 1350 GGAGCCATATCTGTATCTTCAGCCCTTGATCTTCTACCCTGGCTATTACCCTGATGATAA 1409 

I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I M I M 

Db 351 GGAGCCATATCTGTATCTTCAGCCCTTGATCTTCTACCCTGGCTATTACCCTGATGATAA 410 

Qy 1410 TGGTATATATAATCAGAAATTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAAC 1469 

I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I i I I I I I 
Db 411 TGGTATATATAATCAGAAATTTCCATTTAAAACACTTGCCATGGTTACGTCATTCTTAAC 470 

Qy 1470 CAACATTTGCATCTCCTATCTAGCCAAGTATCTATTTG-AAAGTGGAACCTTGCCACCTA 1528 

I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I 

Db 471 CAACATTTGCGTCTCCTATCTAGCCAAATATCTATTTGAAAAGTGGAACCTTGCCACCTA 530 

Qy 152 9 AATTAGATGTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGATAAGACAA 1588 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I 

Db 531 AATTAGATGTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGATAAGACAA 590 

Qy 1589 TTCTTGTCAAAAATGAAAATATTAAATTAGATGAACTTGCACTTGTGAAGCCACGACAGA 164 8 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

Db 591 TTTTTGTCA7W\ATGAAA — TATAAATTAGATGACCTTGCACTTGTGAAGCCACGACAGA 648 

Qy 1649 GCATGACCCTCAGCTCAACTTTCA 1672 

I I M I I I I I I I I I I I I I I I I I I 
Db 649 ACATGACCCTCAGCTTAACTTTCA 672 



RESULT 7 
CD350164 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 



CD350164 707 bp mRNA linear EST 09-JUL-2003 

UI-M-FYO-cfl-h-lO-O-UI »rl NIH_BMAP_FY0 Mus musculus cDNA clone 
IMAGE: 6851099 5', itiRNA sequence. 

CD350164 

CD350164.1 GI: 31141679 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
1 (bases 1 to 707) 
NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; Mus. 



JOURNAL Unpublished (1999) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Jim Lin^ University of Iowa 
cDNA Library preparation: Dr. M. Bento Scares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Scares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Scares, University of Iowa 
Clone Distribution: Distribution information can be found at 

http : / / genome . uiowa . edu/distribution/mousef 1 . html 
This clone was contributed by the Brain Molecular Anatomy Project 

(BMAP) 

Seq primer: pYX-5. 
FEATURES Location/Qualifiers 
source 1. .707 

/organism="Mus mus cuius" 
/mol__type="mRNA" 
/strain="C57BL/6" 
/db_xref-"taxon: 10090" 
/clone="IMAGE: 6851099" 
/tissue_type="whole brain" 

/dev_stage="embryo 13.5,14.5,16.5,17. 5dpc" 
/lab__host="DH10B (Tl phage resistant)" 
/clone_lib="NIH_BMAP_FYO" 

/note="Organ: Brain; Vector: pYX- Asc; Site_l: EcoR I; 
Site_2: Not I; The library was constructed according 
Bonaldo, Lennon and Scares, Genome Research, 6:791-806, 
1996. Denatured RNA was size fractionated on a 1% agarose 
gel. First strand cDNA synthesis was primed with oligo-dT 
primer containing a Not I site. Double strand cDNA was 
size selected according to mRNA size fraction, ligated 
with EcoR I adaptor, digested with NotI and then cloned 
directionally into pYX-Asc vector. The library tag 
sequence located between the Not I site and the polyA tail 
is AGCGAGACAG. This library was created for the University 
Iowa Brain Anatomy Project (BMAP): 'Gene Discovery in the 
Developing Mouse Nervous System', supported by National 
Institute of Mental Health (NIMH), Hemin Chin, Ph.D., 
program coordinator." 

ORIGIN 



Query Match 21,1%; Score 472.2; DB 14; Length 707; 

Best Local Similarity 86.2%; Pred. No. 1.9e-117; 

Matches 58 0; Conservative 0; Mismatches 88; Indels 5; Gaps 5; 

Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

III I I I II I I II I I II I I I I I I I II I I I Mi l I I I I I I I I II II M Ml 

Db 27 ATGTCTTTCCACGTAGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 86 

Qy 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

M II M II II II II I II MM M II II II M M II II I II II II M I M I I III 
Db 87 GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGCAACCCAGAAGAGCGCAGTGAA 146 



Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

I I I I I I M M I II II I II II I II I 11 I I II II I M II II I M I M I I M M I I 

Db 147 GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 206 

Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 



I I I I I I I I I I I I I I I I M I M I I I I I I I M I M I I I I M I I I I I M I I I I I 

Db 207 ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGAAGCAGTGTATGGGCCAGGTTGT 266 

Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

II I I I II I II I M II I I I I I II I I I I II I I M I M I Mill II I I I I I I I I I III 

Db 267 GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 326 

Qy 301 TTCTTTGCAAAACCTATGCGTTC7WVGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

M I I M I I I I I I II I II M II I I I I I I I II I II I I I I I M I I II I Mill M 

Db 327 TTTTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTTCAACAG 386 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGG-AGAAATGTT 419 

M I II I M I II M M II I I II II II M II II II II II II II II III Mill 

Db 387 ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGNAGAGATGTT 446 

Qy 420 CTGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCA-TCAGCGTGATCA-TCGATGTG 477 

I I M II II 11 I I M II II II II I I II II 11 M II I 11 II I 11 II II I I I 11 I II 

Db 447 CTGGGCTGCAGCAATTTTCTCTGCATTAGGGGCCACCATTCAGCGTGATCATTGGATGTG 506 

Qy 478 GATATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGA 537 

Ml II I II I II II II I M II II I I II II II I II I III II II II Mill 
Db 507 GATGTGAACATATCGGTCATTGTCTCTGCACTCATTGCCATTCTTTATACCCTAGTGGGT 566 

Qy 538 GGGCTCTATTCTGTGGCCTACACT-GATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCT 596 

M I II II I II II II II II III I I II I M II II I II II II I II II II II II 
Db 567 GGGCTGTACTCTGTGGCATATACTGGATGTTGTCCAGCTATTCTGCATTTTTATAGGACT 626 

Qy 597 GT-GGATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTG 655 

II I I M II I II II I Mill II I II II II II II II II I II II M II II I I I I 

Db 627 GTGGGATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAG 686 

Qy 656 CTGTGCATGCCAA 668 

I II I M II II II 
Db 687 CTGTGCATGCTAA 699 



RESULT 8 
BY727598 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



BY727598 669 bp rtiRNA linear EST 17-DEC-2002 

BY727598 RIKEN full-length enriched, 6 days neonate medulla 
oblongata Mus musculus cDNA clone B730003H24 5', mRNA sequence. 
BY727598 

BY727598.1 GI:27140725 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (bases 1 to 669) 

Okazaki,Y., Furuno^M., Kasukawa,T., Adachi,J., Bono^H.^ Kondo,S., 
Nikaido,I., Osato,N., Saito.R., Suzuki, H., Yamanaka,!., 
Kiyosawa^H., Yagi,K., Tomaru,Y., Hasegawa,Y., Nogami,A., 
Schonbach,C. , Gojobori,T., Baldarelli, R. , Hill, D. P., Bult,C., 
Hume, D. A., Quackenbush^ J . , Schriml, L.M. , Kanapin,A., Matsuda,H., 
Batalov,S., Beisel,K.W., Blake, J. A., Bradt,D., Brusic,V., 
Chothia,C., Corbani, L . E. , Cousins, S., Dalla,E., Dragani, T . A- , 
Fletcher,C.F. , Forrest, A. , Frazer,K.S., Gaasterland, T . , 



Gariboldi^M. , Gissi,C., Godzik^A., Gough.J., Grimmond, S . , 
Gustincich, S. , Hirokawa,N., Jackson, I , J. , Jarvis,E.D., Kanai,A., 
Kawaji,H., Kawasawa,Y., Kedzierski, R.M. , King.B.L., Konagaya^A., 
Kurochkin.I.V. , Lee,Y., Lenhard.B., Lyons, P. A., Maglott, D. R. , 
Maltais,L., Marchionni , L . , McKenzie^L., Miki.H., Nagashima, T . , 
Numata,K., Okido^T., Pavan.W.J., Pertea^G., Pesole^G., 
Petrovsky,N. , Filial, R., Pontius , J. U . , Qi,D., Ramachandran, S . , 
Ravasi,T., Reed, J. C., Reed, D. J., Reid,J., Ring,B.Z., Ringwald,M., 
Sandelin,A., Schneider, C. , Semple,C.A., Setou,M., Shimada,K., 
Sultana, R., Takenaka,Y., Taylor,M.S., Teasdale, R. D. , Tomita,M., 
Verardo,R., Wagner, L., Wahlestedt , C. , Wang,Y., Watanabe,Y., 
Wells, C, Wilining,L.G. , Wynshaw-Boris, A, , Yanagisawa,M. , Yang,!., 
Yang,L., Yuan,Z., Zavolan,M., Zhu,Y., Zimmer,A., Carninci,P., 
Hayatsu,N. , Hirozane-Kishikawa, T . , Konno,H. , Nakamura,M. , 
Sakazume,N., Sato,K., Shiraki,T., Waki,K., Kawai,J., Aizawa,K., 
Arakawa,T., Fukuda,S., Hara,A., Hashizume, W. , Imotani,K., Ishii,Y., 
Itoh,M., Kagawa,!., Miyazaki,A. , Sakai,K., Sasaki, D., Shibata,K., 
Shinagawa,A. , Yasunishi, A. , Yoshino,M., Waterston, R. , Lander, E.S., 
Rogers, J., Birney,E. and Hayashizaki, Y. 

TITLE Analysis of the mouse transcriptome based on functional annotation 

of 60,770 full-length cDNAs 

JOURNAL Nature 420, 563-573 (2002) 

MEDLINE 22354683 
PUBMED 12466851 
COMMENT Contact: Yoshihide Hayashizaki 

Laboratory for Genome Exploration Research Group, RIKEN Genomic 

Sciences Center (GSC), Yokohama Institute 

The Institute of Physical and Chemical Research (RIKEN) 

1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan 

Tel: 81-45-503-9222 

Fax: 81-45-503-9216 

Email : genome-res(9gsc . riken . go , jp, 

URL : http : / / genome . gsc . riken . go . jp/ 

Adachi,J., Aizawa,K., Akimura,T., Arakawa, T. , Carninci, P. , 
Fukuda,S., Hashizume, W. , Hayashida, K. , Hirozane,T., Hori,F., 
Imotani,K., Ishii,Y., Itoh,M., Kagawa, I., Kawai,J., Kojima,Y., 
Kondo,S., Konno,H., Koya,S., Miyazaki,A., Murata,M., Nakamura,M., 
Nomura, K., Numazaki,R., Ohno,M. , Ohsato,N., Saito,R., Sakazume,N., 
Sano,H., Sasaki, D., Sato,K., Shibata,K., Shiraki,T., Tagami,M., 
Takeda,Y., Waki,K., Watahiki,A., Muramatsu,M. and Hayashizaki, Y. 
Direct Submission 

Computational Analysis of Full-Length Mouse cDNAs Compared with 
Human Genome Sequences Mamm. Genome. 12, 673-677 (2001) 

Normalization and subtraction of cap-trapper-selected cDNAs to 
prepare full-length cDNA libraries for rapid discovery of new 
genes. Genome Res. 10 (10), 1617-1630 (2000) 

RIKEN integrated sequence analysis (RISA) system — 384-format 
sequencing pipeline with 384 multicapillary sequencer. Genome Res. 
10 (11), 1757-1771 (2000) 

Computer-based methods for the mouse full-length cDNA 
encyclopedia: real-time sequence clustering for construction of a 
nonredundant cDNA library. Genome Res. 11 (2), 281-289 (2001) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 



FEATURES 

source 



Please visit our web site (http : //genome . gsc. riken. go, jp) for 
further details* 

Location/Qualifiers 
1. .669 

/organisin="Mus musculus" 

/mol_type="inRNA" 

/db__xref="taxon: 10090" 

/clone="B730003H24" 

/ tissue_type="medulla oblongata'* 

/dev_stage="6 days neonate" 

/lab_host="DH10B" 

/clone_lib="RIKEN full-length enriched, 6 days neonate 
medulla oblongata" 

/note="Site_l: Sail; Site_2: BamHI; cDNA library was 
prepared and sequenced in Mouse Genome Encyclopedia 
Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in 
RIKEN. Division of Experimental Animal Research in Riken 
contributed to prepare mouse tissues. 1st strand cDNA was 
primed with a primer [5' 

GAGAGAGAGAAGGATCCAAGAGCTCTTTTTTTTTTTTTTTTVN 3 ' ] , cDNA was 
prepared by using trehalose thermo-activated reverse 
transcriptase and subsequently enriched for full-length by 
cap-trapper, cDNA went through one round of normalization 
to Rot =20.0 and subtraction to Rot = 459.0. Second 
strand cDNA was prepared with the primer adapter of 
sequence [5 ' GAGAGAGAGATTCTCGAGTTAATTAAATTAATCCCCCCCCCCCCC 
3*]. cDNA was cleaved with Xhol and BamHI. Vector: a 
modified pBluescript KS(+) after bulk excision from Lambda 
FLC I." 



ORIGIN 



Query Match 26.6%; 
Best Local Similarity 86.7%; 
Matches 509; Conservative 



Score 462.8; DB 13; Length 669; 
Pred. No. 6.8e-115; 
0; Mismatches 78; Indels 0; 



Gaps 



0; 



Qy 


1 


Db 


81 


Qy 


61 


Db 


141 


Qy 


121 


Db 


201 


Qy 


181 


Db 


261 


Qy 


241 


Db 


321 



ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 
IN I I I I I I I II I I I I M I I I I I I I I I I I I I I I I I I I I I I II II II III 
ATGTCTTTCCACGTAGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 140 

GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 12 0 

I I I I I I I II I II I I I M MM II II II I II I M I I I I I M I II I I II II I I Ml 

GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGCAACCCAGAAGAGCGCAGTGAA 200 

GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

I I I I M I II I I II i I I I I M II M I I II I I I I I I I I I II I II I I I II I I I I M 

GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 260 
ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 24 0 

I II I II I I I II I I I I I II I I II M II Mill I I I II I M MM M I I M I I 

ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGAAGCAGTGTATGGGCCAGGTTGT 320 

GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

M II II M I M II II M II I II I II I I II I I M I I I II II I M I M II II I I III 
GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 380 



Qy 



301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 



II I II II I II I I M II I I I I I Mill I I M I I II I I I I I II I I I I I I I I I II 

Db 381 TTTTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTTCAACAG 440 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I I I II M I II I I M I I I I II II II II II I I I I I I I I I I I I I I I I I I I I I I I 

Db 441 ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGAGAGATGTTC 500 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

M I I M I I I I I I I I I I I I I I I M M II M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 501 TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCCACCATCAGCGTGATCATTGATGTGGAT 560 

Qy 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

M MM M I I II I II M I II M II I II II I I III II II M I II I I III 

Db 561 GTG7\ACATATCGGTCATTGTCTCTGCACTCATTGCCATTCTTTATACCCTAGTGGGTGGG 620 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTT 587 

Mill I I II M M M II II II M II II II I II II II II I 

Db 621 CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTNT 667 



RESULT 9 
BE233479 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



BE233479 516 bp mRNA linear EST lO-JUL-2000 

139685 MARC IPIG Sus scrofa cDNA 5*, mRNA sequence. 

BE233479 

BE233479.1 GI: 9018197 
EST. 

Sus scrofa (pig) 

Sus scrofa 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla; Suina; Suidae; Sus. 
1 (bases 1 to 516) 

Fahrenkrug, S.C. , Smith, T . P . L . , Freking, B. A. , Cho^J., White, J., 
Valletta., Wise,T., Rohrer,G.A., Pertea^G., Sultana, R., 
Quackenbush, J. and Keele,J.W. 

Porcine gene discovery by normalized cDNA-library sequencing and 

EST cluster assembly 

Mamm. Genome 13 (8), 475-478 (2002) 

22213789 

12226715 

Contact: Smith TPL 

USDA, ARS, US Meat Animal Research Center 
PO Box 166, Clay Center, NE 68933-0166, USA 
Tel: 402 762 4366 
Fax: 402 762 4390 

Email : smith@email . marc . usda ;gov 

Single pass sequencing. Bases called and alt_trimmed with phred 
vO. 980904. e. Vector identified by cross_match with the -minscore 18 
and -minmatch 12 options. 
PCR PRimers 

FORWARD: AGGAAACAGCTATGACCAT 
BACKWARD : GTTTTCCCAGTCACGACG 
Plate: 75 row: G column; 12 
Seq primer: ATTTAGGTGACACTATAG . 

Location/Qualifiers 

1. .516 

/organism="Sus scrofa" 



/mol_type="mRNA" 
/db_xref="taxon: 9823" 
/ tissue_type="pooled" 
/lab_host="DH10B" 
/clone_lib="MARC IPIG" 

/note="Vector : pCMV SP0RT6; Site_l: NotI; Site_2 : Sail; 
Library made from pooled tissue from day 11^ 13, 15, 20, 
and 30 embryos . " 

ORIGIN 

Query Match 23.2%; Score 404; DB 10; Length 516; 

Best Local Similarity 86.4%; Pred. No. 7.2e-99; 

Matches 446; Conservative 0; Mismatches 70; Indels 0; Gaps 0; 
Qy 1067 GTTCCATGTTTGCACGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAG 1126 

INI I I I I I I I I I I I I I I I I I I I I I I II M M I I I I I I I I I I I I I I I I I 

Db 1 GTTCTATGTTTGCTAGAAACATCTAGCAGCTCTCATTCAGACAAAACGCTTCCGACAGGG 60 

Qy 1127 AAATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGG 1186 

I I I I II I I II I II II I I I I I I I I I II I I I I II M II I I I I I I I I I II I I I I I 

Db 61 AGATCGTCTGGGTCATGCGGATCACAGTATTCGTGTTTGGTGCGTCTGCAACAGCCATGG 120 

Qy 1187 CCTTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCG 1246 

II I I M I I I I II II I I I I I I I I I I I I I II I II I II I I II I II II M MINI 

Db 121 CCTTGCTGACCAAGACCGTGTATGGGCTCTGGTACCTCAGCTCCGACCTCGTCTACATCA 180 

Qy 1247 TTATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCG 1306 

I II I I II I M I I I II I II I I I II I I I II I I I I II I I I I I I II II I II I I I I 
Db 181 TTATCTTCCCGCAGCTGCTCTGTGTGCTCTTCATCAAGGGGACCAACACGTACGGGGCCG 240 

Qy 1307 TGGCAGGTTATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATC 1366 

I I I I I I I I I II I I I II II I I I I II II I I II M I I I I I I M I I I II I I 
Db 241 TGGCAGGGTACATTGCTGGCCTTTTCCTGAGGGTAACCGGTGGAGAGCCATACCTGAACC 300 

Qy 1367 TTCAGCCCTTGATCTTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGA 1426 

I I I I I M I I I I I I I I I I I I I I M I I I I I I III I I M M I I II I I I I M I II { 
Db 301 TGCAGCCCTTGATCTTTTACCCTGGTTATTACGTTGAAAAAAATGGTATATATAATCAGA 360 

Qy 1427 AATTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCT 1486 

IN II I I I I I I I I I I I I I I I I I I I II M I I I I I I I I I I I I I I I I I I I M I I 

Db 361 GATTCCCATTTAAAACCCTTGCCATGCTCACCTCCTTCTTATCCAACATTTGCATCTCTT 420 

Qy 1487 ATCTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATG 1546 

M I I I I I II I M I I I I I II I I I II I I I II II I I I I M I I I I I I I I I I I I I I I I I I I 
Db 421 ATCTAGCCAAATATCTATTTGAAAGTGGAACCTTGCCACCAAAATTAGATATGTTTGATG 480 

Qy 1547 CTGTTGTTGCAAGACACAGTGAAGAAAACATGGATA 1582 

I I II I II I I M I I I I I I I I I I I I I II I I I I I I I I II 

Db 4 81 CTGTTGTTGCAAGACACAGTGAAGAAAACATGGATA 516 



RESULT 10 
BB626260 

LOCUS BB626260 650 bp mRNA linear EST 26-OCT-2001 

DEFINITION BB626260 RIKEN full-length enriched, adult male diencephalon Mus 

musculus cDNA clone 9330170D24 5*, mRNA sequence. 
ACCESSION BB626260 



VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
COMMENT 



FEATURES 

source 



BB626260. 1 GI: 164 64298 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus, 
1 (bases 1 to 650) 

Arakawa^T.^ Carninci,P., Fukuda^S., Furuno,M. , Hanagaki,T.^ 
Hara,A., Hiramoto^K., Hori^F., Ishii,Y., Ito^M., Kawai^J., 
Konno,H., Kouda^M., Koya^S., Matsuyama, T . , Miyazaki^A., Nomura, K., 
Ohno.M., Okazaki,Y., Okido^T., Saito^R., Sakai^C, Sakai,K., 
Sano^H., Sasaki, D., Shibata,K., Shinagawa, A. , Shiraki,T., 
Sogabe,Y.^ Suzuki, H., Tagami,M. , Tagawa,A. , Takahashi , F . , 
Takeda,Y., Tanaka,T., Toya,T., Muramatsu,M. and Hayashizaki^ Y. 
RIKEN Mouse ESTs (Arakawa,T., et al. 2001) 
Unpublished (2001) 
Contact: Yoshihide Hayashizaki 

Laboratory for Genome Exploration Research Group, RIKEN Genomic 

Sciences Center (GSC), Yokohama Institute 

The Institute of Physical and Chemical Research (RIKEN) 

1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan 

Tel: 81-45-503-9222 

Fax: 81-45-503-9216 

Email : genome- res @gs c . riken . go . jp, 

URL:http : // genome . gsc. riken. go. jp/ 

Carninci,?., Shibata,Y., Hayatsu,N., Sugahara,Y., Shibata,K., 
Itoh,M., Konno,H., Okazaki,Y., Muramatsu,M. and Hayashizaki, Y. 

Normalization and subtraction of cap-trapper-selected cDNAs to 
prepare full-length cDNA libraries for rapid discovery of new 
genes. Genome Res. . 10 (10), 1617-1630 (2000) 

wagi,K., Fujiwake,S., Inoue,K., Togawa,Y., Izawa,M., Ohara,E., 
Watahiki,M.^ Yoneda^Y., Ishikawa^ T . Ozawa,K., Tanaka^T., 
Matsuura,S., Kawai,J., Okazaki,Y., Muramatsu,M. , Inoue,Y., Kira,A. 
and Hayashizaki, Y. 

RIKEN integrated sequence analysis (RISA) system — 384-format 
sequencing pipeline with 384 multicapillary sequencer. Genome Res. 
10 (11), 1757-1771 (2000) 

Konno,H., Fukunishi, Y . , Shibata,K., Itoh,M., Carninci,P., 
Sugahara,Y. and Hayashizaki, Y. 

Computer-based methods for the mouse full-length cDNA 
encyclopedia: real-time sequence clustering for construction of a 
nonredundant cDNA library. Genome Res. - 11 (2), 281-289 (2001) 

Kondo,S., Shinagawa, A. , Saito^T., Kiyosawa.H., Yamanaka,I., 
Aizawa,K.^ Fukuda^S.^ Hara,A. , Itoh,M., Kawai^^J., Shibata,K. and 
Hayashizaki, Y. 

Computational Analysis of Full-Length Mouse cDNAs Compared with 
Human Genome Sequences. Mamm. Genome. 12, 673-677 (2001) 

Please visit our web site (http://genome.gsc.riken.go.jp) for 
further details, 
e mouse tissues. 

Location/Qualifiers 
1. .650 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="C57BL/6J" 

/db_xref-"taxon: 10090" 
/clone="9330170D24" 



/sex="male" 

/ tissue_type="diencephalon" 

/dev_stage="adult" 

/lab_host="DH10B" 

/clone_lib="RIKEN full-length enriched, adult male 
diencephalon" 

/note="Site_l: Sail; Site_2: BamHI; cDNA library was 

prepared and sequenced in Mouse Genome Encyclopedia 
Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in 
RIKEN. Division of Experimental Animal Research in Riken 
contributed to prepare mouse tissues. 1st strand cDNA was 
primed with a primer [5V 

GAGAGAGAGAAGGATCCAAGAGCTCTTTTTTTTTTTTTTTTVN 3'], cDNA was 
prepared by using trehalose thermo-activated reverse 
transcriptase and subsequently enriched for full-length by 
cap-trapper. cDNA went through one round of normalization 
to Rot = 10.0 and subtraction to Rot = 185.0. Second 
strand cDNA was prepared with the primer adapter of 
sequence [ 5 • GAGAGAGAGATTCTCGAGTTAATTAAATTAATCCCCCCCCCCCCC 
3']. cDNA was cloned into the Xhol and BamHI sites - 
Vector: a modified pBluescript KS(+) after bulk excision 
from Lambda FLC I. Cloning sites, 5' end: Sail; 3* end: 
BamHI " 



ORIGIN 



Query Match 18.9%; Score 329.8; DB 10; Length 650; 

Best Local Similarity 86.0%; Pred. No. 1.4e-78; 

Matches 375; Conservative 0; Mismatches 60; Indels 1; Gaps 1; 
Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

Ml I I I I I I I II I I I I I II II I I M I I I III I I I I M I II II II II III 

Db 201 ATGTCTTTCCACGTAGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 260 

Qy 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGT^A 120 

II I M I I I I II II M I I I I I I I I I I I I II II II MM I II M I II I I I II I III 
Db 261 GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGCAACCCAGAAGAGCGCAGTGAA 320 

Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

M II II II M I M Mill II II I II M I I II I II II I II I II II M M II II I 

Db 321 GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 380 

Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

II II II II II II II II II M I M II I Mill 11 II M II II II II II II I I 
Db 381 ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGAAGCAGTGTATGGGCCAGGTTGT 440 

Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

II I M II I II I I II I II I II I M II I II I II II I II I I I M I II M II II III 

Db 441 GGTCTAGCTTGNGCTCANGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 500 

Qy 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

M II I II M M II M I II II M I I M II M I II M II II M I Mill II 

Db 501 TTTTTTGC-GAACCTATGCNGTCCT^AGGGATATGTGACTATGTTAGACCCATTTCAACAG 559 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

M M II II M I II II II II II M M II II II II I I II II I II I I I MUM 

Db 560 ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCCCTGATGGGAGAGATGTTC 619 



Qy 421 TGGGCTGCAGCAATTT 436 

I I I I I I I I I I I I I I I 
Db 620 TGGGCTGCAGCCATTT 635 



RESULT 11 

AW668962 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 

PUBMED 
COMMENT 



FEATURES 

source 



AW668962 541 bp iriRNA linear EST 25-APR-2001 

111664 MARC IBOV Bos taurus cDNA 5', mRNA sequence. 

AW668962 

AW668962.1 GI:7525476 
EST. 

Bos taurus (cow) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 (bases 1 to 541) 

Smith^T.P.L. , Grosse.W.M., Freking, B . A. ^ Roberts , A. J. , Stone^R.T.^ 
Casas^E., Wray^J.E., White, J., Cho,J., Fahrenkrug, S . C . , 
Bennett, G. L. , Heaton,M. P . , Laegreid, W.W. , Rohrer, G.A. , 
Chitko-McKown, C . G. , Pertea,G., Holt, I., Karamycheva, S . , Liang, F., 

Quackenbush, J. and Keele,J.W. 

Sequence evaluation of four pooled-tissue normalized bovine cDNA 
libraries and construction of a gene index for cattle 
Genome Res. 11 (4), 626-630 (2001) 
21180013 

11282978 

Contact: Smith TPL 

USDA, ARS, US Meat Animal Research Center 
PO Box 166, Clay Center, NE 68933-0166, USA 
Tel: 402 762 4366 
Fax: 402 762 4390 

Email: smith@email.marc.usda.gov 

Single pass sequencing. Bases called and alt_trimmed with phred 
vO. 980904. e. Vector identified by cross_match with the -minscore 18 
and -minmatch 12 options . 
PCR PRimers 

FORWARD: AGGAAACAGCTATGACCAT 
BACKWARD: GTTTTCCCAGTCACGACG 
Plate: 95 row: L column: 20 
Seq primer: ATTTAGGTGACACTATAG . 

Location/Qualifiers 

1. .541 

/organism="Bos taurus" 
/mol_type="mRNA" 
/db__xref ="taxon : 9913" 
/tissue_type="pooled" 
/lab_host="DH10B" 
/clone_lib="MARC IBOV" 

/note="Vector : pCMV SP0RT6; Site_l: NotI; Site_2 : Sail; 
Library made from pooled tissue from lymph node, ovary, 
fat, hypothalamus, and pituitary . " 



ORIGIN 



Query Match 



17.9%; Score 312.8; DB 10; Length 541; 



Best Local Similarity 85.0%; Pred. No. 5.9e-74; 

Matches 350; Conservative 0; Mismatches 62; Indels 0; Gaps 0; 



Qy 889 TCAACAGACTGGAACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCA 948 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I II I II I II I I I I I I I II I 

Db 130 TCAACAGCCTGGAACCAGACTGCATACGGGCCTCTTGCTCCCAGGGAGAAACAGGAGGCA 189 

Qy 949 GACATGATTTTACCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGT 1008 

II M II M II II I I I I I II II I I I I I I I I I I I I II M M II I II I I I I I I I 

Db 190 GACATGATCTTGCCGATTGTCCTCAAGTATCTCTGCCCCGTGTACATTTCTTACTTTGGT 24 9 

Qy 1009 CTTGGTGCAGTTTCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGT 1068 

I I I I I II I I II M II I I II II I I I II I I II I I II I I I I I I I I I I I I I II I II I I I I 

Db 250 CTTGGAGCCGTTTCTGCTGCTGTCATGTCCTCAGCAGATTCTTCCATCTTGTCAGCAAGT 309 

Qy 1069 TCCATGTTTGCACGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAA 1128 

II II I I M I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 310 TCGATGTTTGCTCGCAACATCTACCAGCTTTCATTCAGACAAAATGCTTCTGACAAGGAG 369 

Qy 1129 ATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCC 118 8 

II II I I I I I I II I I I II I I II I I I I II I II II M I I I I I I I I M II I I I 

Db 370 ATAGTCTGGGTCATGCGCATCACGGTATTTGTGTTTGGAGCTTCTGCGATGACCATGGCC 429 

Qy 1189 TTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTT 1248 

I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I 
Db 430 TTGCTAACGAAGACGGTGTATGGGCTCTGGTACCTCAGCTCTGACCTGGTCTACATCATC 489 

Qy 124 9 ATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATG 1300 

I I I I I I I I Ml I I I I II II I I I I I I I I I I I I I I I I I I I MM 

Db 4 90 ATCTTCCCGCAGTTGCTCTGCGTGCTCTTCATCAAGGGTACCAACACGTATG 541 



RESULT 12 

BY729567 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



BY729567 675 bp mRNA linear EST 17-DEC-2002 

BY729567 RIKEN full-length enriched, 12 days embryo spinal cord Mus 
musculus cDNA clone C530033E06 5', mRNA sequence. 
BY729567 

BY729567.1 GI: 27142694 
EST. 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
1 (bases 1 to 675) 

Okazaki,Y., Furuno,M., Kasukawa^T., Adachi,J., Bono^H. 
Nikaido^I., Osato^N., Saito^R., Suzuki, H., Yamanaka,!., 
Kiyosawa^H., Yagi^K., Tomaru^Y.^ Hasegawa,Y., Nogami^A., 
Schonbach^C. , Gojobori,T., Baldarelli, R. , Hill, D. P., Bult,C., 
Hume, D. A., Quackenbush, J. , Schriml, L .M. , Kanapin,A. , Matsuda,H., 
Batalov,S., Beisel,K.W., Blake, J. A., Bradt,D., Brusic,V., 
Chothia,C., Corbani , L . E . , Cousins, S., Dalla,E., Dragani , T . A. , 
Fletcher, C. F. , For rest, A. , Frazer, K. S . , Gaasterland, T . , 
Gariboldi,M. , Gissi,C., Godzik,A., Gough,J., Grimmond,S., 
Gustincich, S . , Hirokawa,N., Jackson, I . J. , Jarvis,E.D., Kanai,A., 
Kawaji,H., Kawasawa,Y., Kedzierski, R.M. , King,B.L., Konagaya,A., 
Kurochkin, I .V. , Lee,Y., Lenhard,B., Lyons, P. A., Maglott, D. R. , 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus. 



Kondo , S , 



Maltais^L., Marchionni, L. , McKenzie,L., Miki^H., Nagashima, T . , 
Numata,K., Okido.T., Pavan.W.J., Pertea.G., Pesole,G., 
Petrovsky, N . , Pillai^R., Pontius, J. U . , Qi,D., Ramachandran, S . , 
Ravasi,T., Reed, J. C, Reed, D. J., Reid,J., Ring,B.Z., Ringwald,M., 
Sandelin,A., Schneider , C . , Semple, C , A. , Setou,M., Shimada,K., 
Sultana, R., Takenaka,Y., Taylor,M.S., Teasdale, R. D . , Tomita/M., 
Verardo,R., Wagner, L., Wahlestedt, C. , Wang,Y., Watanabe,Y., 
Wells, C, Wilming, L . G . , Wynshaw-Boris , A. , Yanagisawa, M. , Yang,!., 
Yang,L., Yuan,Z., Zavolan,M., Zhu,Y., Zimmer,A., Carninci,?., 
Hayatsu,N,, Hirozane-Kishikawa, T . , Konno,H., Nakamura,M. , 
Saka2ume,N. , Sato,K., Shiraki,T., Waki,K., Kawai,J., Aizawa,rK., 
Arakawa,T., Fukuda,S., Hara,A., Hashizume, W. , Imotani , K . , Ishii,Y., 
Itoh,M. , Kagawa,I., Miyazaki,A., Sakai,K., Sasaki, D., Shibata,K., 
Shinagawa, A. , Yasunishi, A. , Yoshino,M., Waterston, R. , Lander, E.S., 
Rogers, J., Birney,E. and Hayashizaki, Y. 

TITLE Analysis of the mouse transcriptome based on functional annotation 

of 60,770 full-length cDNAs 

JOURNAL Nature 420, 563-573 (2002) 

MEDLINE 22354683 
PUBMED 12466851 
COMMENT Contact: Yoshihide Hayashizaki 

Laboratory for Genome Exploration Research Group, RIKEN Genomic 
'Sciences Center (GSC), Yokohama Institute 

The Institute of Physical and Chemical Research (RIKEN) 

1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama, Kanagawa 230-0045, Japan 

Tel: 81-45-503-9222 

Fax: 81-45-503-9216 

Email : genome-res@gsc . riken . go . jp, 

URL: http : / /genome . gsc . riken . go . jp/ 

Adachi,J., Aizawa,K., Akimura,T., Arakawa,T., Carninci,P., 
Fukuda,S., Hashizume, W. , Hayashida, K. , Hirozane,T., Hori,F., 
Imotani, K., Ishii,Y., Itoh,M. , Kagawa,!., Kawai,J., Kojima,Y., 
Kondo,S., Konno,H., Koya,S., Miyazaki,A., Murata,M., Nakamura,M. , 
Nomura, K., Numazaki,R., Ohno,M., Ohsato,N., Saito,R., Sakaziame, N - , 
Sano,H., Sasaki, D., Sato,K., Shibata,K,, Shiraki,T., Tagami,M., 
Takeda,Y., Waki,K., Watahiki,A., Muramatsu,M. and Hayashizaki, Y . 
Direct Submission 

Computational Analysis of Full-Length Mouse cDNAs Compared with 
Human Genome Sequences Maram. Genome. 12, 673-677 (2001) 

Normalization and subtraction of cap-trapper-selected cDNAs to 
prepare full-length cDNA libraries for rapid discovery of new 
genes. Genome Res. 10 (10), 1617-1630 (2000) 

RIKEN integrated sequence analysis (RISA) system — 384-format 
sequencing pipeline with 384 multicapillary sequencer. Genome Res. 
10 (11), 1757-1771 (2000) 

Computer-based methods for the mouse full-length cDNA 
encyclopedia: real-time sequence clustering for construction of a 
nonredundant cDNA library. Genome Res. 11 (2), 281-289 (2001) 

cDNA library was prepared and sequenced in Mouse Genome 
Encyclopedia Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in RIKEN. 
Division of Experimental Animal Research in Riken contributed to 
prepare mouse tissues. 

Please visit our web site (http://genome.gsc.riken.go.jp) for 
further details. 
FEATURES Location/Qualifiers 
source 1. .675 



/organism="Mus musculus" 
/mol_type="mRNA" 
/db_xref="taxon: 10090" 
/clone="C530033E06" 
/tissue_type="spinal cord" 
/dev_stage="12 days embryo" 
/lab_host="DH10B" 

/clonelib="RIKEN full-length enriched, 12 days embryo 
spinal cord" 

/note="Site_l: Sail; Site_2 : BamHI; cDNA library was 
prepared and sequenced in Mouse Genome Encyclopedia 
Project of Genome Exploration Research Group in Riken 
Genomic Sciences Center and Genome Science Laboratory in 
RIKEN. Division of Experimental Animal Research in Riken 
contributed to prepare mouse tissues. 1st strand cDNA was 
primed with a primer [5' 

GAGAGAGAGAAGGATCCAAGAGCTCTTTTTTTTTTTTTTTTW 3 ' ] / cDNA was 
prepared by using trehalose thermo-activated reverse 
transcriptase and subsequently enriched for full-length by 
cap-trapper. Second strand cDNA was prepared with the 
primer adapter of sequence [5* 

GAGAGAGAGATTCTCGAGTTAATTAAATTAATCCCCCCCCCCCCC 3 * ] . cDNA 
was cleaved with Xhol and BamHI. Vector: a modified 
pBluescript KS(+) after bulk excision from Lambda FLC I." 

ORIGIN 

Query Match 16.6%; Score 290; DB 13; Length 675; 

Best Local Similarity 86.5%; Pred. No. l.le-67; 

Matches 320; Conservative 0; Mismatches 50; Indels 0; Gaps 0; 
Qy 889 TCAACAGACTGGAACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCA 948 

II I I I II II II M II I II II II II III I I I M II I I II I II I I I II II III 

Db 306 TCCACAGACTGGAACCAGACTGCCTACGGGTATCCAGATCCCAAGACTAAGGAGGAAGCA 365 

Qy 949 GACATGATTTTACCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGT 1008 

I II II I I I I I II M I II II M II II II I I I II I II I I II II II II I M II I 
Db 366 GACATGATTCTCCCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGG 425 

Qy 1009 CTTGGTGCAGTTTCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGT 1068 

I I I I I II I I I I II I II II II I I M I I I I II I II II I I I II I I I I I II III 

Db 42 6 CTTGGTGCTGTTTCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGT 4 85 

Qy 1069 TCCATGTTTGCACGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAA 1128 

II I I I I I I I I I I II I I I I I I I I II I II I I I I I I M I I I I II I II II Mill III 

Db 486 TCTATGTTTGCTCGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGT^ 545 

Qy 1129 ATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCC 1188 

M II I II I I III I I II II III I II I I I I I I I I I I I M I I I I I I I I I I I I I I 
Db 54 6 ATTGTGTGGGTCATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGCCATGGCT 605 

Qy 1189 TTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTT 1248 

M M I I I I I I I I I I I I I I I II M II I II I I I M I II II I II II II I I I I I I I I I 
Db 606 TTGCTGACGAAGACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATC 665 

Qy 1249 ATCTTCCCCC 1258 

I I I I I I M I 

Db 666 ATCTTCCCAC 675 



RESULT 13 
BE723927 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 

MEDLINE 
PUBMED 
COMMENT 



FEATURES 

source 



BE723927 524 bp mRNA linear EST 25-APR-2001 

198406 MARC 4B0V Bos taurus cDNA 5', mRNA sequence. 

BE723927 

BE723927. 1 GI: 10125223 
EST. 

Bos taurus (cow) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 
Bovidae; Bovinae; Bos. 
1 (bases 1 to 524) 

Smith.T. P.L. , Grosse.W.M. , Freking, B. A. , Roberts, A. J. ^ Stone, R.T., 
Casas,E., Wray,J.E., White, J., Cho,J., Fahrenkrug, S . C . , 
Bennett, G.L. , Heaton,M.P. , Laegreid,W. W. , Rohrer,G.A. , 
Chitko-McKown,C.G. , Pertea,G., Holt, I., Karamycheva, S . , Liang, F., 
Quackenbush, J. and Keele,J.W. 

Sequence evaluation of four pooled-tissue normalized bovine cDNA 
libraries and construction of a gene index for cattle 
Genome Res. 11 (4), 626-630 (2001) 

21180013 
11282978 

Contact: Smith TPL 

USDA, ARS, US Meat Animal Research Center 

PO Box 166, Clay Center, NE 68933-0166, USA 

Tel: 402 762 4366 

Fax: 402 762 4390 

Email : smith@email . marc . usda . gov 

Single pass sequencing. Bases called and alt_trimmed with phred 
vO. 980904. e. Vector identified by cross_match with the -minscore 18 
and -minmatch 12 options . 
PCR PRimers 

FORWARD: AGGAAACAGCTATGACCAT 
BACKWARD : GTTTTCCCAGTCACGACG 
Plate: 106 row: L column: 14 
Seq primer: ATTTAGGTGACACTATAG . 

Location/Qualifiers 

1. .524 

/organism="Bos taurus" 
/mol_type="mRNA" 
/db_xref="taxon: 9913" 
/ tissue_type="pooled" 
/lab_host="DH10B" 
/clone__lib="MARC 4B0V" 

/note="Vector : pCMV SP0RT6; Site_l: NotI ; Site_2 : Sail; 
Library made from pooled tissue from day 20 and day 40 
embryos . " 



ORIGIN 



Query Match 15.7%; Score 274.2; DB 10; Length 524; 

Best Local Similarity 84.7%; Pr.ed. No. 2e-63; 

Matches 331; Conservative 0; Mismatches 58; Indels 2; Gaps 2; 



Qy 889 TCAACAGACTGGAACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCA 94 8 



Db 130 TCAACATCCTGGAACCAGACTGCATACGGGCCTCTTGCTCCCAGGGAGAAACAGGAGGCA 189 

Qy 94 9 GACATGATTTTACCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGT 1008 

I I I I I I I I II II I I I II II I II I I I I II I I 11 Mill I I I I M I I I I I I I I 

Db 190 GACATGATCTTGCCGATTGTCCTCAAGTATCTCTGCCCCGTGTACATTTCTTACTTTGGT 24 9 

Qy 1009 CTTGGTGCAGTTTCTGCTGCTGTTATGTCATCAGCAGATTC-TTCCATCTTGTCAGCAAG 1067 

II I II II II II I I M II I I I I II I I I II I I M I I I I I I I II I I I I I II I I I I I II 

Db 250 CTTGGAGCCGTTTCTGCTGCTGTCATGTCCTCAGCAGATTCTTTCCATCTTGTCAGCAAG 309 

Qy 1068 TTCCATGTTTGCACGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGA 1127 

III I I I I I I I I II II I I I I I II I I I I I I M I I I I I I I I I I I I II I I I I I I I I II 

Db 310 TTCGATGTTTGCTCGCAACATCTACCAGCTTTCATTCAGACAAAATGCTTCTGACAAGGA 369 

Qy 1128 AATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGC 1187 

II II I I I I I I II I I I I II I II I I I I I I II I I II II Mill I I I M I I I 
Db 370 GATAGTCTGGGTCATGCGCATCACGGTATTTGTGTTTGGAGCTTCTGCGATGACCATGGC 429 

Qy 1188 CTTGCTGACGAAAACTGTGTATGGGCTC-TGGTACCTCAGTTCTGACCTTGTTTACATCG 1246 

I I I I II Mill II I M I II I I II I I I II I I I M I II I M I I I I I II II I I I I 
Db 430 CTTGCTAACGAAGACGGTGTATGGGCTCTTGGTACCTCAGCTCTGACCTGGTCTACATCA 48 9 

Qy 1247 TTATCTTCCCCCAGCTGCTTTGTGTACTCTT 1277 

I II II II II III II II II II II II I 
Db 490 TCATCTTCCCGCAGTTGCTCTGCGTGCTCTT 520 



RESULT 14 

AL669749 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



AL669749 800 bp mRNA linear EST 14-JAN-2002 

AL66974 9 directional larval cDNA library Ciona intestinalis cDNA 
clone 052ZB03 5'> mRNA sequence. 
AL669749 

AL669749.1 GI:18143007 
EST. 

Ciona intestinalis 
Ciona intestinalis 

Eukaryota; Metazoa; Chordata; Urochordata; Ascidiacea; Enterogona; 
Phlebobranchia; Cionidae; Ciona. 
1 (bases 1 to 800) 
Genoscope . 

Ciona intestinalis directional larval cDNA library 
Unpublished (2002) 
Contact : Genoscope 

Genoscope - Cent:re National de Sequencage ' 
BP 191 91006 EVRY cedex - France 

Email: seqref@genoscope.cns.fr, Web : www.genoscope.cns.fr 
IMPORTANT: this sequence may contain errors. The Ciona intestinalis 
library from which the clone was isolated may be contaminated with 
cDNAs from bacteria or other Eukarya. 

Directional larval cDNA library originate from Dr.M.Branno, 
Stazione A.Dohrn, Naples, Italy, and was prepared in 
pBluescript2SK+ . 

Location/Qualifiers 

1. .800 

/organism="Ciona intestinalis" 



/mol_type="mRNA" 

/db_xref="taxon:7719" 

/clone="052ZB03" 

/clone_lib="directional larval cDNA library" 
/note="Vector: pBluescript2SK+" 

ORIGIN 

Query Match 15.1%; Score 263.2; DB 9; Length 800; 

Best Local Similarity 62.7%; Pred. No. 2.3e-60; 

Matches 504; Conservative 0; Mismatches 28 6; Indels 14; Gaps 6 

Qy 250 TGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTGTTCTTTGC- 3 08 

I I I I I I I I I I I M I I I I I I II III I I I I II I I I I I I I II 

Db 2 TGGACGCAAGCACCCATTGGATACGCTTGCGCGTTAATACTTGGCGGCTTATTCTTTGCG 61 

Qy 309 AAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAAATCTATGG 368 

II I I I I I I I I I II I II II I I II I I II II I I I I I I I I I I I 

Db 62 AAGTAAAATGCGAAGTGAGGGATATGTGACGATGTTGGATCCACTGCAGCGCAACT-TGG 12 0 

Qy 369 AAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTCTGGGCTGC 428 

I I I I I I I I I I I I I I I I I I I I II I I M I I I I I I I I I I I I I 

Db 121 TCGAGTAATGGGAGCGTTTCTTTATATACCTGCACTTGCTGGAGAATTATTCTGGTCTGC 180 

Qy 429 AGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGATATGCACAT 488 

I I I II I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I 
Db 181 AGCTATATTGGCCGCGTTGGGCGGTACCTTCA-TGTTATCATTGATCTTCATATAACTGC 239 

Qy 489 TTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGGCTCTATTC 548 

I I II I I I I N I I I I I I I II I I I I I M III I I I I M I I I I 

Db 24 0 AGCTGTAATAGTATCTGCATGCATTGCTGTTGTATACACCATGGCCGGTGGTCTTTACTC 299 

Qy 549 TGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGGATCAGCGT 608 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 300 GGTTGCTTATACAGATGTAGTTCAGTTGATTTGCATATTCATTGGACTGTGGTTGAGCAT 359 

Qy 609 CCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTGCATGCCAA 668 

I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I 

Db 360 TCCATTCGCGTTCACTCATCCTGCTGTATCAGACATCGCCACTACAGCTTACCACTCACC 419 

Qy 669 ATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGGCTTGATAG 728 

II I I I I I I I I I I II II I I I I I I 

Db 42 0 TAAC TGGCTTGGTACTTGGGATATTTCGACCACTGGTCTATGGATCGACTC 47 0 

Qy 72 9 TTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGGGTTCTCTC 788 

I I I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II 
Db 471 TGCTCTGCTACTGTTATTTGGTGGAATACCGTGGCAAGTTTACTTTCAT^GAGTTTTATC 530 

Qy 789 TTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGGTGCCTGGT 848 

I I I I I I I I I I I I I I I I I I I I I I I II I I III 

Db 531 GNCNAAAAGNNCANGAAGCGCTCAGAAGCTTTCATTCATTGCTGCGTTCGGATGTTTGTT 590 

Qy 849 GATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGGAACCAGAC 908 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II 
Db 591 CATGTCAATACCTTCGATATTNATCGGTGCAATTGCTGCATCTACAGATTGGGACGCAAC 650 

Qy 909 TGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTACCAATTGT 968 

I I I I I I I I I I III IN II I I I I I I I I I I I I I I I I 



Db 



651 ATCGTACGGCCTCCCAAGTCCAGTTGANAAAGGCGACCAAGCCAATATTCTACCCATTGT 710 



Qy 969 TCTGCAGTATCT.C-TGCCCTGTGTATATTTCTTTCTTT-GGTCTTGGTGCAGTTTCTGCT 1026 

M II II III I I I II I I I II I I I I I I II Mill II I I M M I I I 
Db 711 GCTTCAATACCTCACCCCCTGTAGCTGTATCATTCTTTGGGGCTTGGCGCTGTTTCTGCT 770 

Qy 1027 GCTGTTATGTCATCAGCAGATTCT 1050 

Mill II M I I II M II Ml 

Db 771 GCTGTNATGTCATCTGCCGACTCT 794 



RESULT 15 

BW274870 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BW274870 549 bp mRNA linear EST ll-NOV-2002 

BW274870 Nori Satoh unpublished cDNA library, gastrula and neurula 
Ciona intestinalis cDNA clone cign070cl2 5', mRNA sequence. 
BW274870 

BW274870. 1 01:248554 81 
EST. 

Ciona intestinalis 
Ciona intestinalis 

Eukaryota; Metazoa; Chordata; Urochordata; Ascidiacea; Enterogona; 
Phlebobranchia; Cionidae; Ciona. 

1 (bases 1 to 549) 

Satou,Y.^ Shin-i^T., Kohara^Y, and Satoh, N. 
Expressed genes in Ciona intestinalis (2002c) 
Unpublished (2002) 
Contact: Nori Satoh 
Department of Zoology 
Kyoto University 

Sakyo-ku, Kyoto, Kyoto 606-8502, Japan 
Tel: 81-75-753-4081 
Fax: 81-75-705-1113 

Email : satoh@ascidian . zool . kyoto-u. ac. jp . 
Location/Qualifiers 
1. .549 

/organism="Ciona intestinalis" 
/mol_type="mRNA" 
/db_xref="taxon:7719" 
/clone="cign070cl2" 
/tissue_type="whole body" 
/dev_stage="gastrula and neurula" 

/clone_lib="Nori Satoh unpublished cDNA library, gastrula 
and neurula" 



ORIGIN 



Query Match 13.0%; Score 225.8; DB 13; Length 549; 

Best Local Similarity 64.4%; Pred. No. 3.4e-50; 

Matches 354; Conservative 0; Mismatches 193; Indels 3; 



Gaps 



1; 



Qy 

Db 



11 ATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTGGTTGGAATAT 70 
II I I M I I II I I I M I I I I II II II I II I I I II I I I 

2 ATGTTCCTGGTTTAGTGNCTATTATCGTCTTCTACGTTGCTATTCTAGCGATCGGTATTT 61 



Qy 

Db 



71 GGGCTGCCTGGAGJ^ACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAAGCCATCATAG 130 

II II I M I I I III I I I M I I I I M I I I II I M I M I 

62 ATGCAGCATGGAGGAAAAGAAGAACCGGAAGAGGAAACGAG AGCGAGACAATCATGG 118 



Qy 131 TTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCTACCTGGGTCG 190 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 119 TCGGGGGAAGAGACATCGGACTCTTTGTTGGAAGCTTTACTATGACTGCTACGTGGGTAG 178 

Qy 191 GAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTATGGCCTAGCTT 250 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I MINI I I I I I I 

Db 179 GTGGTGGTTACATCAACGGCACAGCAGAAGTTGTATACACCCCGGGTTCCGGTCTACTGT 238 

Qy 251 GGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTGTTCTTTGCAA 310 

M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 239 GGACACAAGCGCCATTTGGTTACGGCTGCAGCCTCATGCTTGGCGGGTTGTTTTTCGCTA 2 98 

Qy 311 AACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAAATCTATGGAA 370 

I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

Db 299 AGAAAATGCGGACTCAGGGTTACGTCACCATGCTGGATCCATTGCAACGTAAGCTTGGCA 358 

Qy 371 AACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTCTGGGCTGCAG 430 

I I I I I I I I I I I I III I I I I I I M I I I I II I I I I I I I I I I I I I 

Db 359 GGCGCATGGGGGGTCTGTTGTACTTACCAGCACTCTTGGGTGAAATATTCTGGTCAGCCG 418 

Qy 431 CAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGATATGCACATTT 4 90 

III I I I M I II I II I I I I I I I I I I I I I I I I M I III 
Db 419 CCATCCTTGCCGCTCTTGGCGGTACATTGTCCGTGATCATAGACCTTGATATTCGTATCT 478 

Qy 4 91 CTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGGCTCTATTCTG 550 

II II II I I I M I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 

Db 47 9 CTGTCATTGTATCTGCATGTATTGCTGTGTTGTATACGTTGGTTGGTGGTCTGTATTCGG 538 

Qy 551 TGGCCTACAC 560 

I I I I II II 
Db 539 TGGCTTATAC 54 8 

Search completed: September 29, 2004, 02:03:10 
Job time : 6782 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search^ using sw model 
Run on : 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



September 28, 2004, 20:07:21 ; Search time 10873 Seconds 

(without alignments) 
6948.120 Million cell updates/sec 

US-10-069-541-5 
1743 

1 atggctttccatgtggaagg ctgaagataatttacagtga 1743 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



3470272 seqs, 21671516995 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



6940544 



Database 



GenEmbl : * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 



gb_ba : * 
gb_htg : * 
gb_in : * 
gb_om: * 
gb_o V : * 
gb_pat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 
gb_ro : * 
gb_sts : * 
gb_sy : * 
gb_un : * 
gb_vi : * 
em_ba : * 
em_f un : * 
em_hum: * 
em_in : 
em_mu : * 
em_om : * 
em_o r : * 
em_ov: * 
em_pat : * 
em_ph : * 
em_pl : * 
em_ro : * 
em sts : * 



28 


: em 


un : 


29 


: era_ 


VI : * 


30 


: em 


htg hum:* 


31 


: em 


htg inv:* 


32 


: em_ 


htg other:* 


33 


em_ 


htg mus : * 


34 


em 


htg pin:* 


35 


em 


htg rod:* 


o b 


em 


htg mam: * 


37 


em_ 


_htg_vrt : * 


38 


em__ 


_sy : * 


39. 


em 


htgo hum:* 


40- 


em 


htgo mus : * 


41: 


em 


htgo other:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


1743 


100. 


0 


1743 


6 


E49871 


E49871 High-affini 


2 


1743 


100 . 


0 


1743 


6 


BD012719 


BD012719 High-affi 


3 


1743 


100. 


0 


1743 


9 


AF276871 


AF276871 Homo sapi 


A 


1 1 /I '3 


inn 
lUU . 


U 


1 Q 1 "3 

io lo 


y 


ribA4 U14 DD 


AJ4 014 66 Homo sapi 


5 


1743 


100. 


0 


5158 


9 


AB043997 


AB043997 Homo sapi 


6 


1738.2 


99. 


7 


1743 


6 


AR268949 


AR2 68 94 9 Sequence 


7 


1394.2 


80. 


0 


1743 


6 


E49870 


E49870 High-affini 


8 


1394 .2 


80. 


0 


1743 


6 


BD012718 


BD012718 High-affi 


9 


1394.2 


80. 


0 


4904 


10 


AB030947 


AB030947 Rattus no 


10 


1375 


78- 


9 


1743 


10 


AF276872 


AF276872 Mus muscu 


11 


1373.4 


78. 


8 


1743 


6 


E49872 


E49872 High-affini 


12 


1373.4 


78. 


8 


1743 


6 


BD012720 


BD012720 High-affi 


13 


1373.4 


78. 


8 


4938 


6 


M080443 


AX080443 Sequence 


14 


1367 


78. 


4 


1743 


10 


MMU401467 


AJ4 01467 Mus muscu 


15 


867 


49. 


7 


2528 


5 


TMA420808 


AJ420808 Torpedo m 


15 


730 


41. 


9 


1132 


5 


GGA511267 


AJ511267 Gallus ga 


17 


630.8 


36. 


2 


2239 


9 


HSA308384 


AJ308384 Homo sapi 


18 


630.8 


36. 


2 


190043 


9 


AC009963 


AC009963 Homo sapi 


19 


502.8 


28. 


8 


232792 


2 


AC106657 


AC106657 Rattus no 


20 


501.2 


28. 


8 


155131 


2 


AC102873 


AC102873 Mus muscu 


21 


431.4 


24. 


8 


3326 


3 


AY011119 


AY011119 Limulus p 


22 


405.8 


23. 


3 


3255 


3 


AY047521 


AY047521 Drosophil 


23 


363.8 


20. 


9 


1731 


6 


E49869 


E49869 High-affini 


24 


363.8 


20. 


9 


1731 


6 


BD012717 


BD012717 High-affi 


25 


363.8 


20. 


9 


1985 


3 


AB030946 


AB030946 Caenorhab 


26 


279.6 


16. 


0 


386 


6 


AX080449 


AX080449 Sequence 


27 


242.6 


13. 


9 


1461 


6 


AX432086 


AX432086 Sequence 


28 


226 


13. 


0 


1657 


9 


HSA308383 


AJ308383 Homo sapi 


29 


179.6 


10. 


3 


1178 


9 


HSA308378 


AJ308378 Homo sapi 


30 


179.6 


10. 


3 


186989 


3 


AC007812 


AC007812 Drosophil 


31 


179.6 


10. 


3 


189117 


3 


AC009395 


AC009395 Drosophil 


32 


179.6 


10. 


3 


255620 


3 


AE003723 


AE003723 Drosophil 


33 


167.6 


9. 


6 


140156 


2 


AC017381 


AC017381 Drosophil 





34 


163 


9 , 


4 


2326 


9 


HSA308379 


AJ308379 Homo sapi 




35 


155 


8 . 


9 


1467 


9 


HSA308382 


AJ308382 Homo sapi 


c 


36 


151. 8 


8 . 


7 


40893 


3 


CBRG45E19 


AC084631 Caenorhab 




37 


150. 2 


8 . 


6 


736 


9 


HSA308381 


AJ308381 Homo sapi 




38 


150 


8 . 


6 


1308 


9 


HSA308380 


AJ308380 Homo sapi 


c 


39 


141 . 6 


8 . 


1 


39908 


3 


CEC48D1 


Z81049 Caenorhabdi 


c 


40 


141. 6 


8. 


1 


330724 


2 


CEY67H2 


AL022475 Caenorhab 


c 


41 


132 


7. 


6 


152021 


2 


AC010923 


AC010923 Drosophil 




42 


98 


5. 


6 


616 


11 


G84799 


G84799 S208P6036FB 


c 


43 


51.8 


3. 


0 


10732 


6 


E32986 


E32986 Gene encodi 


c 


44 


49 


2. 


8 


2000 


6 


AX655393 


AX655393 Sequence 




45 


48.2 


2. 


8 


2781 


1 


BSU92466 


U92466 Bacillus su 



ALIGNMENTS 



RESULT 1 

E49871 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



E49871 1743 bp DNA linear PAT 27-AUG-2002 

High-affinity choline transporter. 

E49871 

E49871.1 GI:22554902 

JP 2001136976-A/3. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1743) 
Haga,T. and Okuda^T. 
High-affinity choline transporter 
Patent: JP 2001136976-A 3 22-MAY-2001; 
SCIENCE & TECH AGENCY 
OS Homo sapiens (human) 
PN JP 2001136976-A/3 
PD 22-MAY-2001 
PF 27-DEC-1999 JP 1999368991 
PI TATSUYA HAGA^TAKASHI OKUDA 

PC C12N15/09,A01K67/027,A61K38/00,C07K14/47,C07K16/18,C07K19/00, 
PC C12N5/10, 

PC C12P21/02,C12P21/08,C12Q1/00,C12N15/00,A61K37/02,C12N5/00 CC 
FH Key Location/Qualifiers 
FT CDS (1) . . (1743) . 

Location/ Qualifiers 

1. .1743 

/organism="Homo sapiens" 
/mol_type="genomic DNA" 
/db xref="taxon:9606" 



Query Match 100.0%; Score 1743; DB 6; Length 1743; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1743; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 



61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

I I I I I I I I I I I I I I I I ! I I I I I I M I I I I I I I I I M I I I I I I I I I I I M I I I I I I M I I I 

61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I 
121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I M I I 
181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 2 4 0 

241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

301 TTCTTTGCAA?lACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

I I I I I M I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 

301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 
361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 
421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 

421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 
4 81 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 

481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 
541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I I I I M I I I I I I I I I I I M I I I M I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 

541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 
601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I I I I I I I I I I I I I I I M I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 

601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 
661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 72 0 

I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 

661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 72 0 
721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I 

721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 
7 81 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 84 0 

I I M I I I M M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

7 81 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 
841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I M I I I I I I M I I I I I I I I I I I I I I 
841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 



901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I I I I M I I I I I I I I I I I I I I I M M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I 
961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

I I M I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 

1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAPlTCGTTTGGGTT 1140 

1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

I I I I I M M I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I i I I I I I I I I I I I I 

1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 
1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

I I I I I i I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I j I I 

1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 
1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 

1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 
1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I I I M I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I i I I I M I I I I I I I I I I I I 

1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 
1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 144 0 

I M M M M I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I 

1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 
1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 
1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

I N I I Ml I I I I I I I I I I 1 i I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

N I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 

1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I M I I I I I I I I I I I I 
1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 174 0 

1741 TGA 1743 



Db 



I I I 

1741 TGA 1743 



RESULT 2 
BD012719 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



linear PAT 02-AUG-2002 



Craniata; Vertebrata; Euteleostomi ; 
Catarrhini; Hominidae; Homo. 



BD012719 1743 bp DNA 

High-affinity choline transporter. 

BD012719 

BD012719. 1 GI : 22 092908 
WO 0116315-A/3. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chorda ta; 
Mammalia; Eutheria; Primates; 
1 (bases 1 to 1743) 
Haga,T. and Okuda^T. 
High-affinity choline transporter 
Patent: WO 0116315-A 3 08-MAR-2001; 

JAPAN SCIENCE AND TECHNOLOGY CORP,TATSUYA HAGA, TAKASHI OKUDA 
OS Homo sapiens (human) 
PN WO 0116315-A/3 
PD 08-MAR-2001 

PF 18-AUG-2000 WO 2000JP005545 

PR 27-AUG-1999 JP 99P 240642, 27-DEC-1999 JP 99P 368991 PI 
TATSUYA HAGA.TAKASHI OKUDA 

PC C12N15/12,C07K14/47,C12Q1/68,C07K19/00,C07K16/18,C12N5/10, PC 
A61K38/17, 

PC A61K45/00,A61P25/28,G01N33/53,A01K67/027 
CC 

FH Key 
FT CDS 



FEATURES 

source 



ORIGIN 



Location/Qualifiers 
(1) . . (1743) . 

Location/Qualifiers 

1. .1743 

/organism="Homo sapiens" 
/mol_type=" genomic DNA" 
/db xref="taxon: 9606" 



Query Match 100.0%; Score 1743; DB 6; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 1743; Conservative 0; Mismatches 



Qy 

Db 



Length 1743; 
0; Indels 0; 



Gaps 



1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 

M I I I I M I M I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 



0; 



60 



60 



Qy 

Db 



61 



61 



GTTGGAATATGGGCTGCCTGGAGAACCTWW^CAGTGGCAGCGCAGAAGAGCGCAGCGAA 
I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 



120 



120 



Qy 

Db 



12 1 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 

I I I I I M I I M M M I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 

121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 



180 



180 



Qy 



181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

M I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 



Qy 


241 


GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 

1 1 M 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 

GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 


300 


Db 


241 


300 


Qy 


301 


TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 

1 M 1 1 1 1 1 1 1 1 ! M 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 

TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 


360 


Db 


301 


360 


Qy 


361 


ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 
1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 
ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 


420 


Db 


361 


420 


Qy 


421 


TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 i 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 

TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 


480 


Db 


421 


480 


Qy 


481 


ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 
1 1 1 1 1 1 1 1 N 1 1 1 1 1 M 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 I 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 i 1 1 
ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 


540 


Db 


481 


540 


Qy 


541 


CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 

M M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M t 1 1 1 M 1 1 1 1 M 

CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 


600 


Db 


541 


600 


Qy 


601 


ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 
1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 
ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 


660 


Db 


601 


660 


Qy 


661 


CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 

i M 1 1 1 1 1 1 1 M 1 M M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I M 1 1 M M 1 

CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 


720 


Db 


661 


720 


Qy 


721 


CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 
1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 I 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 


780 


Db 


721 


780 


Qy 


781 


GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 

1 N M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 

GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 


840 


Db 


781 


840 


Qy 


841 


TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 

1 1 1 M 1 1 1 1 M 1 i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 


900 


Db 


841 


900 


Qy 


901 


AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 

1 N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 

AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 


960 


Db 


901 


960 


Qy 


961 


CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 


1020 


Db 


961 


1020 


Qy 


1021 


TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 

1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 

TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 


1080 


Db 


1021 


1080 



Qy 


1081 


CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 

1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 

CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 


1140 


Db 


1081 


1140 


Qy 


1141 


ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 
N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 


1200 


Db 


1141 


1200 


Qy 


1201 


ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 

1 1 M 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 M 1 1 1 1 1 M 1 1 1 1 1 

ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 


1260 


Db 


1201 


1260 


Qy 

Db 


1261 
1261 


CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 


1320 
1320 


Qy 


1321 


TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 

1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M M 1 1 1 M 1 1 1 1 1 1 1 1 

TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 


1380 


Db 


1321 


1380 


Qy 


1381 


TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 

IIINMIMIMMIIIIIIIIIIIIllliiiMiMiiiiiiiiiiiiiiiiiiiiii 

TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 


1440 


Db 


1381 


1440 


Qy 


1441 


ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 

IINIIIIIIIIIMIIIIIIMIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIllMI 

ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 


1500 


Db 


1441 


1500 


Qy 


1501 


CTATTTGATVAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 

1 1 1 1 1 1 1 1 M 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M { 1 1 I 

CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 


1560 


Db 


1501 


1560 


Qy 


1561 


CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 

IIMIIMIMIIIIIIIIIMIIMIIIIIIIIIIIIIIIMIMIIIIIIMMIIII 

CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 


1620 


Db 


1561 


1620 


Qy 


1621 


GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 

1 1 1 N 1 1 1 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 M 1 1 1 1 1 1 1 

GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 


1680 


Db 


1621 


1680 


Ov 


-L O O X 


bACjCjrL-L. 1 i i i bAl G rTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 

1 N 1 1 1 1 1 1 M M 1 1 1 1 1 1 1 i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 M 1 1 

GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 


1740 


Db 


1681 


1740 


Qy 


174-1 


TGA 1743 
1 1 1 

TGA 1743 




Db 


1741 





RESULT 3 
AF276871 

LOCUS AF276871 1743 bp mRNA linear PRI 27-NOV-2000 

DEFINITION Homo sapiens high affinity choline transporter (SLC5A7) mRNA, 
complete cds . 

ACCESSION AF276871 

VERSION AF276871.1 GI: 10998441 



KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
MEDLINE 
PUBMED 
ERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 




gene 
CDS 



Hoino sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1743) 

Apparsundaram, S. , Ferguson, S .M. , George, A. L. Jr. and Blakely,R.D. 
Molecular cloning of a human, hemicholinium-3-sensitive choline 
transporter 

Biochem. Biophys - Res. Commun. 276 (3), 862-867 (2000) 

20483599 

11027560 

2 (bases 1 to 1743) 

Apparsundaram, S., Ferguson, S .M. and Blakely,R.D. 
Direct Submission 

Submitted ( 09- JUN-2000 ) Department of Pharmacology and Center for 
Molecular Neuroscience, Vanderbilt University, 23rd Avenue South at 
Pierce, Nashville, TN 37232-6420, USA 

Location/ Qualifiers 

1. .1743 

/organism="Homo sapiens" 

/mol_t ype= "mRNA" 

/db_xref="taxon:9606" 

/ chromosome="2 " 

/map="2ql2" 

•1. .1743 

/gene="SLC5A7 " 

1. .1743 

/gene="SLC5A7" 

/note="hCHT; solute carrier family 5 member 7" 
/ codon_start=l 

/product="high affinity choline transporter" 
/protein_id="AAG25940. 1" 
/db_xref="GI : 10998442" 

/translation="MAFHVEGLIAIIVFYLLILLVGIWAAWRTKNSGSAEERSEAIIV 
GGRDIGLLVGGFTMTATWVGGGYINGTAEAVYVPGYGLAWAQAPIGYSLSLILGGLFF 
AKPMRSKGYVTMLDPFQQIYGKRMGGLLFIPALMGEMFWAAAIFSALGATISVIIDVD 
MHI SVI I SALIATLYTLVGGLYSVAYTDWQLFCI FVGLWI SVPFALSHPAVADIGFT 
AVHAKYQKPWLGTVDS SEVYSWLDS FLLLMLGGI PWQAYFQRVLS S S SATYAQVLS FL 
AAFGCLVMAIPAILIGAIGASTDWNQTAYGLPDPKTTEEADMILPIVLQYLCPVYISF 
FGLGAVSAAVMS S ADS S I L S AS SMFARNI YQLS FRQNAS DKEI VWVMRI TVFVFGAS A 
TAMALLTKTVYGLWYLSSDLVYIVIFPQLLCVLFVKGTNTYGAVAGYVSGLFLRITGG 
EPYLYLQPLIFYPGYYPDDNGIYNQKFPFKTLAMVTSFLTNICISYLAKYLFESGTLP 
PKLDVFDAWARHSEENMDKT I LVKNENI KLDELALVKPRQSMTLS STFTNKEAFLDV 
DSSPEGSGTEDNLQ" 



ORIGIN 



Query Match 100.0%; Score 1743; DB 9; Length 1743; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1743; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I 
1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 



Qy 



61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCG7VA 120 
I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 



Db 


61 


GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 


120 


Qy 


121 


GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 

M 1 1 1 1 M 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 i 1 1 1 1 M 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 

GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 


180 


Db 


121 


180 


Qy 


181 


ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 

M 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 

ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 


240 


Db 


181 


240 


Qy 


241 


GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 
1 1 1 1 N 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 II 

GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 


300 


Db 


241 


300 


Qy 


301 


TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 

1 1 1 M 1 1 1 M 1 1 1 M 1 1 1 1 M 1 M 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 

TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 


360 


Db 


301 


360 


Qy 


361 


ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 

M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 { 1 1 II 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 

ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 


420 


Db 


361 


420 


Qy 


421 


TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 

1 1 1 M 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 

TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 


480 


Db 


421 


480 


Qy 


481 


ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 

1 1 1 1 1 1 1 M 1 1 M 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 


540 


Db 


481 


540 


Qy 


541 


CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 

1 M 1 1 M M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 

CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 


600 


Db 


541 


600 


Qy 


601 


ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 

N 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 


660 


Db 


601 


660 


Qy 


661 


CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 

1 1 N 1 1 1 1 1 N 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 

CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 


720 


Db 


661 


720 


Qy 


721 


CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 

1 N 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 


780 


Db 


721 


780 


Qy 


781 


GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 

1 1 1 1 1 1 N M M 1 M 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 

GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 


840 


Db 


781 


840 


Qy 


841 


TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 

1 i M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 i 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 I 1 1 
1 1 ■ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M 1 1 1 1 1 1 { 1 1 1 1 1 { 1 1 1 j 1 1 1 1 1 1 1 1 1 } 1 1 j 1 1 1 1 { 1 1 

TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 


900 


Db 


841 


900 


Qy 


901 


AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 


960 


Db 


901 


960 



Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 



CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 102 0 

I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M M I I 

CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 102 0 

TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 108 0 
I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 114 0 
I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I 
CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 114 0 

ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

I I I I I M I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

I M I I I I M I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 12 60 

CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 
I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M i I I I I 
CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 
I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 144 0 
I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAG7\AATTTCCATTTAAA 1440 

ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I 
ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 
I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

I M I I I I M I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I 

CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 162 0 
GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 168 0 

I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I i I I I I I M I I I I I M I I I I I I I M I I I 

GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 
GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 174 0 

I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I M 

GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 174 0 
TGA 1743 



RESULT 4 

HSA401466 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 
CDS 



HSA401466 1813 bp mRNA linear PRI 16-AUG-2000 

Homo sapiens itiRNA for high affinity choline transporter (CHTl 

gene) . 
AJ401466 

AJ401466.1 GI:9843753 

ChTl gene; high affinity choline transporter. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Wieland,A.^ Bonisch^H. and Bruss^M. 

Molecular cloning of the human and murine high affinity choline 

transporters and characterization ofthe human gene-structure 

Unpublished 

2 (bases 1 to 1813) 

Bruess^M. 

Direct Submission 

Submitted ( 14-AUG-2 000 ) Bruess M. , University of Bonn, Pharmacology 
and Toxicology, Reuter str. 2b, D-53113 Bonn, GERMANY 

Location/Qualifiers 

1. .1813 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref="taxon:9606" 

/ chr omo s ome= " 2 " 

/map="2qll-13" 

/tissue_type="hypothalamus" 

1. .1813 
/gene="CHTl" 
19. .1761 
/gene="CHTl" 

/function=" sodium- and chloride-dependent reuptake of 

choline" 

/codon_start=l 

/evidence=experimental 

/product="high affinity choline transporter" 

/protein_id="CAC03717.1" 

/db_xref="GI : 9843754" 

/ db_x r e f - " GOA : Q 9 G Z V3 " 

/db_xref="SPTREMBL:Q9GZV3" 

/translation="MAFHVEGLIAIIVFYLLILLVGIWAAWRTKNSGSAEERSEAIIV 
GGRDIGLLVGGFTMTATWVGGGYINGTAEAVxVPGYGLAWAQAPIGYSLSLILGGLFF 
AKPMRSKGYVTMLDPFQQI YGKRMGGLLFI PALMGEMFWAAAI FSALGATI SVI I DVD 
MHI SVI I SALIATLYTLVGGLYSVAYTDWQLFCIFVGLWISVPFALSHPAVADIGFT 
AVHAKYQKPWLGTVDSSEVYSWLDSFLLLMLGGIPWQAYFQRVLSSSSATYAQVLSFL 
AAFGCLAmAIPAILIGAIGASTDWNQTAYGLPDPKTTEEADMILPIVLQYLCPVYISF 
FGLGAVSAAVMSSADSSILSASSMFARNIYQLSFRQNASDKEIVWVMRITVFVFGASA 
TAMALLTKTVYGLWYLSSDLVYIVIFPQLLCVLFVKGTNTYGAVAGYVSGLFLRITGG 
EPYLYLQPLIFYPGYYPDDNGIYNQKFPFKTLAMVTSFLTNICISYLAKYLFESGTLP 
PKLDVFDAWARHS EENMDKT I LVKNENI KLDELALVKPRQSMTLS STFTNKEAFLDV 

DSSPEGSGTEDNLQ" 



ORIGIN 



Query Match 100.0%; Score 1743; DB 9; 

Best Local Similarity 100.0%; Pred. No. 0; 
Matches 1743; Conservative 0; Mismatches 0; 



Length 1813; 
Indels 0; Gaps 



0; 



Qy 


1 


Db 


19 


Qy 


61 


Db 


79 


Qy 


121 


Db 


139 


Qy 


181 


Db 


199 


Qy 


241 


Db 


259 


Qy 


301 


Db 


319 


Qy 


361 


Db 


379 


Qy 


421 


Db 


439 


Qy 


481 


Db 


499 


Qy 


541 


Db 


559 


Qy 


601 


Db 


619 


Qy 


661 


Db 


679 


Qy 


721 


Db 


739 


Qy 


781 



ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 7 8 

GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I M I I I I I M I I I I I I I I 

GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 138 

GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

I I I I I I I I I I I I I i I I I M M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I 
GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 198 

ACCTGGGTCGGAGGAGGGTATATCTVATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I M I I I I I M I I I I M I I I I I I I I 
ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 258 

GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I 
GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 318 

TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I M I I I I I I I I 
TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 378 

ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I M I I I I I I 
ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 438 

TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I M I I I I I I I 
TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 498 

ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M M I I I I I I 1 I I I I I I I I I M I I I I 
ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 558 

CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 618 

ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I Mi I I I I I I I I I M 

ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 67 8 

CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 72 0 
M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I 
CATGCCAT^ATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 738 

CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I i I I I I I I M 

CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 798 
GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 84 0 



Db 799 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 858 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I I I M I M I I M M I I M M I I I I M I I I M I I I I I I M M M I I I M M I I I M I I I I 

Db 859 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 918 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 919 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 978 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 

Db 979 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1038 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 11 I I I I I I 
Db 1039 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1098 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 114 0 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1099 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1158 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I 
Db 1159 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1218 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I 

Db 1219 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 127 8 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1279 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1338 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I M I M I I I I I I I I I I I I I I I I I I I I 

Db 1339 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1398 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTT/J^ 1440 

I I I I I I I I I M I I I I I I I I M M 1 I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
Db 1399 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1458 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M 

Db 1459 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGlvVr 1518 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I M I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 
Db 1519 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1578 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

I I I I I I I I M M I I I M I I M I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1579 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1638 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I M i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



1639 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATA7\A 1698 



Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 174 0 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M 
Db 1699 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1758 

Qy 1741 TGA 1743 

I I I 

Db 1759 TGA 1761 
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AB043997 5158 bp mRNA linear PRI 19-NOV-2000 

Homo sapiens mRNA for high-affinity choline transporter CHTl, 
complete cds . 
AB043997 

AB043997.1 GI: 11231080 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleos tomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (sites) 

Okuda^T, and Haga^T. 

Functional characterization of the human high-affinity choline 
transporter 

FEBS Lett. 484 (2), 92-97 (2000) 

20521663 
11068039 

2 (bases 1 to 5158) 
Okuda^T. 

Direct Submission 

Submitted ( 30-MAY-2000) Takashi Okuda, University of Tokyo, Faculty 
of Medicine, Department of Neurochemistry; 7-3-1 Kongo, Bunkyo-ku, 
Tokyo 1130033, Japan (E-mail : okuda@m. u-tokyo • ac. jp, 
URLrhttp : //park. ecc. u-tokyo. ac. jp/neurochemistry, 
Tel: 81-3-5841-3560, Fax: 81-3-6814-8154 ) 

Location/ Qualifiers 

1. .5158 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref="taxon: 9606" 
/tissue_type="spinal cord" 
277. .2019 
/codon_stai:t=l 

/product="high-af f inity choline transporter CHTl" 

/protein_id="BABl8161 . 1" 
/db_xref="GI: 11231081" 

/translation="MAFHVEGLIAIIVFYLLILLVGIWAAWRTKNSGSAEERSEAIIV 
GGRDIGLLVGGFTMTATWVGGGYINGTAEAVYVPGYGLAWAQAPIGYSLSLILGGLFF 
AKPMRSKGYVTMLDPFQQIYGKRMGGLLFIPALMGEMFWAAAIFSALGATISVIIDVD 
MHISVIISALIATLYTLVGGLYSVAYTDWQLFCIFVGLWISVPFALSHPAVADIGFT 
AVHAKYQKPWLGTVDS SEVYSWLDS FLLLMLGGI PWQAYFQRVLS SS SAT YAQVLS FL 
AAFGCLVMAI PAI LI GAI GASTDWNQTAYGLPDPKTTEEADMI LP I VLQYLCPVYI S F 
FGL GAVS AAVMS SAD S S I L S AS SMFARN I YQL S FRQN AS DKE I VWVMRI T VFVFGAS A 
TAMALLTKTVYGLWYLSSDLVYIVIFPQLLCVLFVKGTNTYGAVAGYVSGLFLRITGG 



ORIGIN 



EPYLYLQPLIFYPGYYPDDNGIYNQKFPFKTLAMVTSFLTNICISYLAKYLFESGTLP 
PKLDVFDAWARHSEENMDKTILVKNENIKLDELALVKPRQSMTLSSTFTNKEAFLDV 
DSSPEGSGTEDNLQ" 



Query Match 100.0%; Score 1743; DB 9; Length 5158; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1743; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 277 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 336 

Qy 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 337 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 396 

Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 397 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 456 

Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I 
Db 457 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 516 

Qy 241 GGCCTAGCTTGGGCTCAGGCACC7\ATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 517 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 576 

Qy 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I 
Db 577 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 636 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 637 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 696 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I M I I I I I I I I M I I I I I I I I I I I 
Db 697 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 756 

Qy 4 81 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

I I I I M I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I N M I I I I I I I I I I I I I I I 
Db 757 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 816 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCSTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M 
Db 817 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 876 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 877 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 936 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 937 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 996 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

I I I I I I I I I I I I I I I I I I I I I II I M I I I M I I I I M M I I I I M ! I I I I I I I I I I I I I I 

997 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 1056 

781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 
I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
1057 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 1116 

841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
1117 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 117 6 



901 



960 



AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1177 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 1236 

961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 102 0 
I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
1237 CC7VATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1296 

1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

I I I I I I I I I I I I I I I I M M II I M I M I M I I I M I I I I I I I I I I I M I I I I I I I I I I I 

1297 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1356 

1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

II II II II M II I I I I I I I I I II II I M I I I I I II I II II II I I I I I I I I I I I I I I I I I I 

1357 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1416 

1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

I I I I I M I I I I I I II I II M I I II I I I I I I II I I I I I I I I II I I II I I I I I I I II I I I II 
1417 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1476 

1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

I I I M I I II II I I I II I I I II II I I I I I I I I II I I I I I I I I I II I I I I II I II I I II I II 
1477 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1536 

1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 132 0 

I I I I I I I I I I I II M I I I I I I I I I I I I I I I II I I I I I II I I II I I I I I I I I I I I I I I I I I 
1537 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1596 

1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I I I I I II I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II 
1597 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1656 

1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I II I i I II I M I I I I I I I I I I I I I I II I 
1657 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1716 

1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I II I I I I I I I I I M I I I I I I I I I I I 

1717 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 177 6 
1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I I I I I I I I I M I I I I I M I I I M I I I I II I I I I I I I I I I I I I I I I I I M I I I I I I I II 

1777 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1836 
1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 



M I I I I I M I I M I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I 

Db 1837 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1896 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I M I I I I I I I I I I M M I I I 

Db 1897 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1956 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

I I I I M M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1957 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 2016 

Qy 1741 TGA 1743 

I I I 

Db 2017 TGA 2019 



RESULT 6 
AR268949 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 



TIXLE^ 
^FOURNA L 



AR268949 1743 bp DNA 

Sequence 1 from patent US 6500643. 

AR268949 

AR2 68949. 1 GI: 2969968 6 

Unknown. 

Unknown. 

Unclassified. 

1 (bases 1 to 1743) 

Wu,D.-H., Gu,Y., Millard, W.J. and He, Y. -J. 

.JiiHnan high affi nity choline transp orter 
Patent: U5 6b00643-A 1 31-DEC-2O0^i ; ^ 



linear PAT lO-APR-2003 



FEATURES 

source 



ORIGIN 



Location/Qualif iers 
1. .1743 

/ or ganism= " unknown " 
/inol_type=" genomic DNA" 



Query Match 99.7%; 
Best Local Similarity 99.8%; 
Matches 1740; Conservative 



Score 1738.2; 
Pred. No. 0; 
0; Mismatches 



DB 6; Length 1743; 

3; Indels 0; Gaps 



0; 



Qy 

Db 



ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I M I I I I I I 

ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 



Qy 

Db 



61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I 

61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 



Qy 

Db 

Qy 

Db 



121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

I I I I M I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 

121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I 
181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 



Qy 



241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 



I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M 

Db 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

Qy 301 TTCTTTGGAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

I I I I I I M I I I I I I I I I I I I I I I I I I I I M M M I I I I I I I I I I I I I M I I I I I I I I I I I 

Db 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I I M I I I I I M I I I I I I I I I I I I M I I M I M I I I I I I I I I I I I I M I I I I I I I [ I I I I 

Db 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 48 0 

Qy 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I 

Db 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I II I I M I I M I I I I I I I I I I I I i I I 
Db 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I II I I II II I II I I II I II I M M I I I I II II I I I II II I II I I II I I I II I I I M II II 

Db 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

II I II I I II II I I II II I I I I I I I II II II II II II I M I II I I II I I II II II II II II 

Db 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 84 0 

I I I I I I I II I M I I I I II II M I I II I II I I II I I II II II I II I I I I I I I I I I I I I I I I 
Db 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 84 0 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II 
Db 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCCTCCACAGACTGG 900 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I I I II I I I I I I I I I I I I I I I I II I I I I I I I M I I I M M I I I II I I I I I I I I I I I I I I I I 

Db 901 AACGAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGA'fTTTA 960 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II II M I I I I I I I I I I I I I I I I I I I I I I I II M I I I II I II I I I I I I I I I II I I I I I I I I 

Db 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

I I I I I I I II I I I II I M I I I I I I I I I I M M I I I I I I I I M I II I I I I I I I I I I I I II II 
Db 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 



Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

I I I I I I I I I I I I I I I I II I II II II II II I II I I I II I I II I I I I I II I I I I I I I I I I II 



Db 



1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 12 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

M I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I M I I I M I I I M I I I I I I I I I I I 

12 01 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 132 0 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 
1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I M I I I 

1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 138 0 
1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

I M M I I M I I I I I I M I I I I I M I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

14 41 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 150 0 

1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

1561 CACAGTGAAGAAAACATGGATAAGACTKATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I t I I I I I I I I ! I I I 
1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 162 0 

1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I I I I I I I I I M I I I I I M I I I M M M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 

1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 
1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAA 1740 

1741 TGA 1743 
I I I 

1741 TGA 1743 



RESULT 7 

E49870 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



E49870 1743 bp DNA linear 

High-affinity choline transporter. 

E49870 

E49870.1 GI:22554901 
JP 2001136976-A/2. 
Rattus sp. 
Rattus sp. 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 



PAT 27-AUG-2002 



Euteleostomi; 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



ORIGIN 



Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; 
Rattus . 

1 (bases 1 to 1743) 
Haga,T. and Okuda^T. 

High-affinity choline transporter 

Patent: JP 2001136976-A 2 22-MAY-2001; 

SCIENCE & TECH AGENCY 

OS Rattus sp. (rat) 

FN JP 2001136976-A/2 

PD 22-MAY-2001 

PF 27-DEC-1999 JP 1999368991 

PI TATSUYA HAGA^TAKASHI OKUDA 

PC C12N15/09,A01K67/027,A61K38/00,C07K14/47,C07K16/18,C07K19/00, 
PC C12N5/10, 

PC C12P21/02,C12P21/08,C12Q1/00,C12N15/00,A61K37/02,C12N5/00 CC 
FH Key Location/Qualifiers 
FT CDS (1). .(1743). 

Location/Qualifiers 
1. .1743 

/organism="Rattus sp." 

/mol_type="genomic DNA" 
/db xref="taxon: 10118" 



Query Match 80.0%; 
Best Local Similarity 87.5%; 
Matches 1525; Conservative 



Score 1394.2; DB 6; 
Pred. No. 0; 
0; Mismatches 218; 



Length 1743; 
Indels 0; Gaps 



Qy 

Db 



1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

III II I II II M I I I I I I I I I I II I II III I I I I I I I II I I I I II II III 
1 ATGCCTTTCCATGTAGAAGGACTAGTAGCGATTATCCTGTTCTACCTTCTTATATTTCTG 60 



Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 12 0 

II II II M II II I II I I I I I I II II I II M I I I II I I I I I I I I I I I II I I I I I 
61 GTTGGAATATGGGCTGCATGGAAAACCAAATJtACAGCGGTAATGCAGAAGAACGCAGCGAA 120 

121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

II I I I I I I I I II M I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 GCCATCATAGTTGGGGGCCGAGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 180 

181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 24 0 

I I I I I I I I I II I I II I II Mill II I I I I I II I I I I I I I I I I I I I I I I I I I I 

181 ACCTGGGTTGGAGGAGGTTACATCAACGGGACAGCTGAAGCAGTTTATGGGCCAGGTTGT 240 

241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

II I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I i I I I I I I I I I I I I II I I I I I I I 

241 GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTGATTTTAGGTGGCCTG 300 

301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

II I I I I I I I I M I I I I I I I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I I I M 
301 TTTTTTGCAAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCGTTTCAACAG 360 

361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I I I I I I I I I I I I I I I I I I II II Mill M I I I I I I I I M M M I M II I II I 
361 ATCTATGGAAAGCGCATGGGTGGGCTGCTGTTCATCCCTGCACTGATGGGAGAGATGTTC 420 



Qy 



421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 4 80 



I I I I I I I I I I I I M I M I I I I I I II II II I I I I I I I M I I II II I I I II I I I i I 

Db 421 TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCTACCATCAGCGTAATCATTGATGTGGAT 4 80 

Qy 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 54 0 

II I I II II I I I II I I I I I I I I II I I I I I I I III M II II I I I M I I I I 

Db 4 81 GTGAACATATCGGTCATTGTCTCCGCACTCATTGCCATTCTTTATACCCTCGTGGGAGGG 54 0 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

Mill M I I I M I M I I M II I I II I I I I I II I I I II I I II I I I I I I I II 
Db 541 CTCTACTCTGTGGCATATACTGATGTTGTACAGCTATTCTGCATTTTTATAGGATTGTGG 600 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I I I I I Mill II I II I I II I I I I I I II II I I I I I I II I M II M I I I I M II 

Db 601 ATCAGTGTCCCATTTGCCCTGTCACATCCTGCAGTCACCGACATTGGATTCACTGCTGTG 660 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

II I II M II II II I II I II II I II I M II I I III I I M II II M I 111 

Db 661 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 720 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 7 80 

I II I II I I I II I I II I II I I I I I I I II I M I M I I I II I M I I I I II I I I M I I I I 

Db 721 CTTGATAATTTTCTGTTGTTGATGCTGGGTGGAATACCATGGCAAGCCTACTTCCAGAGG 7 80 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

M II M I II I II II II I I II II II I M I M II M II II II I M II II II I I III 

Db 781 GTCCTCTCTTCATCGTCAGCGACCTATGCTCAGGTGCTGTCCTTCCTGGCAGCTTTTGGG 84 0 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I II I I II I I II M I II I M II I II I I II II I I I I I I I II I II I I I M II II 

Db 841 TGCCTGGTGATGGCTCTACCAGCCATTTGCATTGGGGCCATTGGAGCCTCCACAGACTGG 900 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

Mill II II II II M I I I I I II II I II II I I I I I M M I I M M I I II I I I 

Db 901 AACCAAACTGCATATGGGTTTCCAGATCCCAAGACCAAGGAGGAAGCAGACATGATTCTC 960 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II II II I I II I I I II I II II II II II II I I II I I I I II I I I I II II II I I III 

Db 961 CCGATTGTTCTACAGTACCTCTGCCCTGTGTACATTTCCTTCTTTGGGCTTGGTGCTGTT 1020 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

M I II M II II Mill II M II II I II M I I II II M II M I II I I M I II 

Db 1021 TCTGCTGCTGTCATGTCCTCGGCTGACTCATCCATCCTATCAGCAAGTTCCATGTTTGCT 108 0 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 114 0 

I II I I II II II I I I M II II I I I II II I II II I I II I II II I I I II II I I II I 
Db 1081 CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 114 0 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

III I I II I I II II I II II II I II II II I II M I I II II II II II II I II I II II I 

Db 1141 ATGAGGATCACTGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTCACGAAG 1200 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

I II I I M I I I II II I I I II II I I II I I M I II I II I MUM I II I II II I III 

Db 1201 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1260 



Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

Mill M I II M II II I II II II M I II M I II II II I M II I M M I II 



Db 



1261 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1320 



Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 



TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I III M I M I M I I I I I II II I II I M M I M I M II I I I I I I I I I III 

TTTGGACTTTTCCTGAGAATTACCGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1380 

TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

I I II I I I I I II I I M I II I I II I I I I I I I I M MM II II M I II I I 

TTCTACCCTGGTTATTACCCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 1440 

ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

M II II M I II I II I II I II I I M II M II M I I II I I I I M I I II I I I I M 

ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCCTATCTAGCCAAGTAT 1500 
CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I II I II I M II I II I M II II M II I I M I I II I I I I II II M II II I M II I I 

CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATATATTTGATGCTGTTGTCTCAAGG 1560 
CACAGTGAAGAAAACAT GGATAAGACAATTCTTGTCAAAAATGAAAAT ATTAAATTAGAT 1620 

II II I II II I I M M I II I II I I I Mill II II I M M M II 11 I II I I I II 

CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAACATCAAATTAAAT 1620 



I I II II II I III II II I II M I I M I I I II I M II I M II I II M I I M I I I I 



GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

Mill II II II II II M II II II M I I II I II I II II M II I II II II I I I I I I 
GAGGCTCTCCTTGATGTTGATTCCAGTCCAGAGGGATCTGGGACTGAAGATAACTTACAA 1740 



I I 
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BD012718 1743 bp DNA linear PAT 02-AUG-2002 

High-affinity choline transporter. 

BD012718 

BD012718 .1 GI : 22 092 907 
WO 0116315-A/2. 

Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheriav * Rodentia; Sciurognathi ; Muridae; Murinae; 
Rattus . 

1 (bases 1 to 1743) 
Haga,T. and Okuda^T. 
High-affinity choline transporter 
Patent: WO 0116315-A 2 08-MAR-2001; 

JAPAN SCIENCE AND TECHNOLOGY CORP.TATSUYA HAGA^TAKASHI OKUDA 
OS Rattus norvegicus (rat) 
PN WO 0116315-A/2 
PD 08-MAR-2001 

PF 18-AUG-2000 WO 2000JP005545 

PR 27-AUG-1999 JP 99P 240642 , 27-DEC-1999 JP 99P 368991 PI 



TATSUYA HAGA.TAKASHI OKUDA 

PC C12N15/12,C07K14/47,C12Q1/68,C07K19/00,C07K16/18,C12N5/10, PC 
A61K38/17, 

PC A61K45/00,A61P25/28,G01N33/53,A01K67/027 
CC 

FH Key Location/Qualifiers 
FT CDS (1). .(1743). 

FEATURES Location/Qualifiers 
source 1. .1743 

/organism="Rattus norvegicus" 
/mol type=" genomic DNA" 
/db__xref="taxon: 10116" 

ORIGIN 

Query Match 80.0%; Score 1394.2; DB 6; Length 1743; 

Best Local Similarity 87.5%; Fred. No. 0; 

Matches 1525; Conservative 0; Mismatches 218; Indels 0; Gaps 0; 

ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

III I I I I I I M I I I I I I I I I I I I I I II III I I I I I I I I I I I I I II II III 
ATGCCTTTCCATGTAGAAGGACTAGTAGCGATTATCCTGTTCTACCTTCTTATATTTCTG 60 

GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 12 0 

I I II M II M I II M I I I I I I II I I I I II I M I II I I I M I I I I II I I II I M 

GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGTAATGCAGAAGAACGCAGCGAA 120 

GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

I I I I M I I I I I I I I M I II I I I I I I I I I II I I I II I I I I II I I II I I I I I II I I I 
GCCATCATAGTTGGGGGCCGAGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 180 

ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

I I I I II I I M I II I I I II Mill II I I I I I I II M I I I I I I I I I I I I I I I I I 

ACCTGGGTTGGAGGAGGTTACATCAACGGGACAGCTGAAGCAGTTTATGGGCCAGGTTGT 24 0 

GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

II I I I I I I I I M I I I I I I M I I II I I I I I I II I I I I I M I I I I I I I I M I I I I II I I 
GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTGATTTTAGGTGGCCTG 300 

TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I II I I I II I II 

TTTTTTGCAAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCGTTTCAACAG 360 

ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I I I I I I I I I I I I I I I I I I II i I I I I I I II I II I I I II I I I I I I I I I I I I I II 
ATCTATGGAAAGCGCATGGGTGGGCTGCTGTTCATCCCTGCACTGATGGGAGAGATGTTC 420 

TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I M I I I I I I II I I I I I I I I II I II M II II I I I I M I M I I I I I I I I M I I I I 

TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCTACCATCAGCGTAATCATTGATGTGGAT 4 8 0 

ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

II II II II I M I I I I I I I I M I I I M I M I III M II II I I I I I I I I I 
GTGAACATATCGGTCATTGTCTCCGCACTCATTGCCATTCTTTATACCCTCGTGGGAGGG 540 

CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

II II I I I I I I II I II I I I M I I I II I I I I I II I I I I M I M I I I I Mill 

CTCTACTCTGTGGCATATACTGATGTTGTACAGCTATTCTGCATTTTTATAGGATTGTGG 600 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 



Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II II I II I I II I II 

Db 601 ATCAGTGTCCCATTTGCCCTGTCACATCCTGCAGTCACCGACATTGGATTCACTGCTGTG 660 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

II III I II M I II I M I I I I I M I II I I I I I I M I I I I I I I I II I II I 

Db 661 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 720 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

M I I I I I M I I I I I I I II I I M I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 

Db 721 CTTGATAATTTTCTGTTGTTGATGCTGGGTGGAATACCATGGCAAGCCTACTTCCAGAGG 7 80 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

II I I I I I I I I II Mill I I I I I I I I II I I I M M I I I I I I I I M I I I I I I I III 
Db 781 GTCCTCTCTTCATCGTCAGCGACCTATGCTCAGGTGCTGTCCTTCCTGGCAGCTTTTGGG 840 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M II I I I I I I I I I 

Db 841 TGCCTGGTGATGGCTCTACCAGCCATTTGCATTGGGGCCATTGGAGCCTCCACAGACTGG 900 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I II II I I I I I I I I I I II I 

Db 901 AACCAAACTGCATATGGGTTTCCAGATCCCAAGACCAAGGAGGAAGCAGACATGATTCTC 960 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I II I I I I I i III 

Db 961 CCGATTGTTCTACAGTACCTCTGCCCTGTGTACATTTCCTTCTTTGGGCTTGGTGCTGTT 102 0 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

I I I I I I I I M I Mill II II II II I I I II I I II II I II II I M II I II I M 
Db 1021 TCTGCTGCTGTCATGTCCTCGGCTGACTCATCCATCCTATCAGCAAGTTCCATGTTTGCT 1080 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

I II II I M II I II I II I M M I II II I M I I M I II Mill Mill II Mill 

Db 1081 CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 114 0 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

Ml I I I II I II I II II M II II II I M II I II I II II II II I II M II II Mill 

Db 1141 ATGAGGATCACTGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTCACGAAG 1200 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

M II II I M II I I M II M M II II II I M I M I II II II I I I I M I M M III 

Db 1201 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1260 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

Mill I M II I I M II I II M M II II II I II M II II II II II I M I II 

Db 1261 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1320 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I Ml II II II M II I II II Mill II I II I II II I II I I II II I II III 

Db 1321 TTTGGACTTTTCCTGAGAATTACCGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1380 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

II II M II II I I II II II I II I I M M I I II I II M M I M II II M I II I I 

Db 1381 TTCTACCCTGGTTATTACCCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 1440 



Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

II II I M I M I I II M I I I I I I I I I I I I M II I I II I I I I I I I I I I I II M I 

Db 1441 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCCTATCTAGCCAAGTAT 1500 

Qy 1501 CTATTTGTVAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

M M I II I I II II II II I I II II II I I I II I I I I I I I I I I I II I I I I I I I I I I I 
Db 1501 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATATATTTGATGCTGTTGTCTCAAGG 1560 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

I I I II I I I I M I M I I I II Mill I I I I I INI M I I M M I M I I I I I I II 
Db 1561 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAACATCAAATTAAAT 1620 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I I I I II I I II III Mill I M II I II I I I M I II I I I II I I II I II II I II M I 
Db 1 62 1 GAACTTGCACCTGTAAAGCCTCGACAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAA 1680 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

Mill I II II I M I II I M II I II I II M I M I I II M II I I M I II M Mill 

Db 1681 GAGGCTCTCCTTGATGTTGATTCCAGTCCAGAGGGATCTGGGACTGAAGATAACTTACAA 1740 

Qy 1741 TGA 1743 

I I I 

Db 1741 TGA 1743 
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AB030947 4904 bp mRNA linear ROD 03-FEB-2000 

Rattus norvegicus mRNA for high-affinity choline transporter CHTl, 
complete cds . 
AB030947 

AB030947 .1 GI : 6863033 

choline transporter; high-affinity choline transporter CHTl. 
Rattus norvegicus (Norway rat) 
Rattus norvegicus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; 
Rattus . 

1 (sites) 

Okuda,T., Haga,T., Kanai,Y., Endou,H., Ishihara,T. and Katsura^I. 

Identification and characterization of the high-affinity choline 
transporter 

Nat. Neurosci. 3 (2), 120-125 (2000) 
20116099 

10649566 

2 (bases 1 to 4904) 
Okuda^T. 

Direct Submission 

Submitted ( 09-AUG-1999 ) Takashi Okuda^ University of Tokyo, Faculty 

of Medicine, Department of Neurochemistry; Hongo 7-3-1, Bunkyo-ku 

113-0033, Japan (E-mail : okuda@m.u-tokyo. ac . jp, Tel : +81-3-5841-3560, 

Fax:+81-3-3814-8154) 

Sequence updated ( 11- Jan-2000) . 

Location/Qualifiers 

1. .4904 

/organism="Rattus norvegicus" 
/mol_type="mRNA" 



/strain="Wistar" 
/db_xref="taxon: 10116" 
/clone="CHTl" 

/tissue_type="spinal cord" 
/clone_lib="rat spinal cord cDNA library" 
/dev_stage="adult" 
CDS 224. .1966 

/codon_start=l 

/product="high-af f inity choline transporter CHTl" 
/protein_id="BAA90484 .1" 
/db_xref="GI: 6863034" 

/translation="MPFHVEGLVAIILFYLLIFLVGIWAAWKTKNSGNAEERSEAIIV 
GGRDIGLLVGGFTMTATWVGGGYINGTAEAVYGPGCGLAWAQAPIGYSLSLILGGLFF 
AKPMRSKGYVTMLDPFQQIYGKRMGGLLFIPALMGEMFWAAAIFSALGATISVIIDVD 
VNISVIVSALIAILYTLVGGLYSVAYTDWQLFCIFIGLWISVPFALSHPAVTDIGFT 
AVHAKYQS PWLGT I ES VEVYTWLDN FLLLMLGGI PWQAYFQRVLS S S SAT YAQVLS FL 
TVAFGCLVMALPAICIGAIGASTDWNQTAYGFPDPKTKEEADMILPIVLQYLCPVYISF 
FGLGAVSAAVMSSADSSILSASSMFARNIYQLSFRQNASDKEIVWVMRITVFVFGASA 
TAMALLTKTVYGLWYLSSDLVYIIIFPQLLCVLFIKGTNTYGAVAGYIFGLFLRITGG 
EPYLYLQPLIFYPGYYPDKNGIYNQRFPFKTLSMVTSFFTNICVSYLAKYLFESGTLP 
PKLDIFDAWSRHSEENMDKTILVRNENIKLNELAPVKPRQSLTLSSTFTNKEALLDV 
DSSPEGSGTEDNLQ" 

ORIGIN 

Query Match 80.0%; Score 1394.2; DB 10; Length 4904; 

Best Local Similarity 87.5%; Pred. No. 0; 

Matches 1525; Conservative 0; Mismatches 218; Indels 0; Gaps 0; 

Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

I I I I I I I I I I I I I I I I M I I I I I I I II III II I I I I I I II I I I II II III 

Db 224 ATGCCTTTCCATGTAGAAGGACTAGTAGCGATTATCCTGTTCTACCTTCTTATATTTCTG 283 

Qy 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 12 0 

II II I II II II I M I II II II II I I I II I I I II II I I I I I II I I II I I I I I I I 

Db 284 GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGTAATGCAGAAGAACGCAGCGAA 343 

Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 344 GCCATCATAGTTGGGGGCCGAGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 403 

Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 24 0 

II I I I II I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 404 ACCTGGGTTGGAGGAGGTTACATCAACGGGACAGCTGAAGCAGTTTATGGGCCAGGTTGT 463 

Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

I I I I III I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I i. I 

Db 464 GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTGATTTTAGGTGGCCTG 523 

Qy 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I M I I I I I II 

Db 524 TTTTTTGCAAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCGTTTC7\ACAG 583 

Qy 361 ATCTATGGAAJ\ACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I I I I I I I I I I I I I I I I I I II II I I I I I II I I I M I I I I I I I I I I I I I I I I I I 
Db 584 ATCTATGGAAAGCGCATGGGTGGGCTGCTGTTCATCCCTGCACTGATGGGAGAGATGTTC 643 



Qy 



421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 4 80 



Db 



I I I I I I I I I I I I M I I I I I I I I I II M II I I I M II I I I I Mill II M I I I II 

644 TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCTACCATCAGCGTAATCATTGATGTGGAT 703 



Qy 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 54 0 

II I II I II I I I II MM II M II II I II M III II II M M M II II I 

Db 7 04 GTGAACATATCGGTCATTGTCTCCGCACTCATTGCCATTCTTTATACCCTCGTGGGAGGG 7 63 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

II M I II M M M II II I M I II II Mill M II M M II I II II Mill 

Db 764 CTCTACTCTGTGGCATATACTGATGTTGTACAGCTATTCTGCATTTTTATAGGATTGTGG 823 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

II II I I II II II II I II II I II II II II M II I II II I II II M II M II II 

Db 82 4 ATCAGTGTCCCATTTGCCCTGTCACATCCTGCAGTCACCGACATTGGATTCACTGCTGTG 8 83 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

II II I II II II II I II II M II I II I I II II 111 II II I I M II I Ml 
Db 884 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 943 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 78 0 

II M I M II II II M II II M II II M II M II I II I M II II II Mill MUM 

Db 944 CTTGATAATTTTCTGTTGTTGATGCTGGGTGGAATACCATGGCAAGCCTACTTCCAGAGG 1003 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

11 II II II II II II II I II M M I II II II I II II I II M II II II I II M III 
Db 1004 GTCCTCTCTTCATCGTCAGCGACCTATGCTCAGGTGCTGTCCTTCCTGGCAGCTTTTGGG 1063 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

II II M I I II II II I M II M II M II II II II II II II II M II II II II I 

Db 1064 TGCCTGGTGATGGCTCTACCAGCCATTTGCATTGGGGCCATTGGAGCCTCCACAGACTGG 1123 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

II II I II M II II I M I II II II II I II II M I I II II M II II II II II I 
Db 1124 AACCAAACTGCATATGGGTTTCCAGATCCCAAGACCAAGGAGGAAGCAGACATGATTCTC 1183 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II I II II II I Mill II I M II II I M M II I II II II II II M M II I I Ml 

Db 1184 CCGATTGTTCTACAGTACCTCTGCCCTGTGTACATTTCCTTCTTTGGGCTTGGTGCTGTT 1243 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

II II I M I II I Mill II II M II II M M I II II II M M II I II M II I 

Db 1244 TCTGCTGCTGTCATGTCCTCGGCTGACTCATCCATCCTATCAGCAAGTTCCATGTTTGCT 1303 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 114 0 

II II I II M M II I II II II II II M I II II II I II II II I II II I M Mill 
Db 1304 CGGAATATCTACCAGCTYTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 1363 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

Ml I I II I I II II I M II II II II I II I II II I II II I II II II II II M Mill 
Db 1364 ATGAGGATCACTGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTCACGAAG 1423 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

II II II II II II 11 II M II I II II II II M M I II MUM I I I II II II III 

Db 1424 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 14 83 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

Mill II II II M M I I M II II II II II I I II II II I II II 11 M M II 



Db 



14 8 4 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1543 



Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I III II M I II I I M I I II I I I I I I II I I I I I I I I II I I I I M I I I III 

Db 1544 TTTGGACTTTTCCTGAGAATTACCGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1603 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

II I I I I I II II II I I I II II I I I I M I II I I M I II II I M II I I I I I I I I I 

Db 1604 TTCTACCCTGGTTATTACCCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 1663 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

II II I I I II I I I I I I I I I II I I II I I I I II M I I M I M I I I I I M I I I I I I 
Db 1664 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCCTATCTAGCCAAGTAT 1723 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I I I I I I II I I II II I II II I I II I II I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 172 4 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATATATTTGATGCTGTTGTCTCAAGG 1783 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I MM II I II I I II II I M II I II 
Db 1784 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAACATCAAATTAAAT 1843 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I I I I II I I I I III I I M I I I I II II I I I II I I I II I M I II I I II II I II II I I 

Db 1844 GAACTTGCACCTGTAAAGCCTCGACAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAA 1903 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 17 4 0 

II I II I I I II I I II I II I I I II II II I M I M I I II I M II I M II I M II I I I 

Db 1904 GAGGCTCTCCTTGATGTTGATTCCAGTCCAGAGGGATCTGGGACTGAAGATAACTTACAA 1963 

Qy 1741 TGA 1743 

I I I 

Db 1964 TGA 1966 



RESULT 10 

AF276872 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



AF276872 1743 bp luRNA linear ROD 28-FEB-2001 

Mus musculus sodium and chloride-dependent high-affinity choline 
transporter itiRNA, complete cds . 
AF276872 

AF276872 .2 GI : 13162669 

Mus musculus (house mouse) 

Mus musculus 

Eukaryota; Metazoa; Chordata,-^' Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

1 (bases 1 to 1743) 

Apparsundaram, S , , Ferguson, S .M. and Blakely,R.D. 
Molecular cloning and characterization of human and murine 
high-affinity choline transporters 
Unpublished 

2 (bases 1 to 1743) 

Apparsundaram, S . , Ferguson, S .M. and Blakely^R.D. 
Direct Submission 

Submitted ( 09- JUN-2000 ) Department of Pharmacology and Center for 
Molecular Neuroscience, Vanderbilt University, 23rd Avenue South at 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REMARK 
COMMENT 
FEATURES 

source 



CDS 



Pierce, Nashville, TN 37232-6420, USA 
3 (bases 1 to 1743) 

Apparsundaram, S . , Ferguson, S .M. and Blakely,R.D. 
Direct Submission 

Submitted {28-FEB-2001 ) Department of Pharmacology and Center for 
Molecular Neuroscience, Vanderbilt University, 23rd Avenue South at 
Pierce, Nashville, TN 37232-6420, USA 
Sequence update by submitter 

On Feb 28, 2001 this sequence version replaced gi: 11527247. 
Location/Qualifiers 
1. .1743 

/organism="Mus mus cuius" 
/mol_type="mRNA" 
/db_xref="taxon: 10090" 
1. .1743 

/codon_s tart=l 

/product="sodium and chloride-dependent high-affinity 
choline transporter" 
/protein_id="AAG36945 .2" 
/db_xref="GI : 13162670" 

/translation="MPFHVEGLVAIILFYLLIFLVGIWAAWKTKNSGNPEERSEAIIV 
GGRDIGLLVGGFTMTATWVGGGYINGTAEAVYGPGCGLAWAHAPIGYSLSLILGGLFF 
AKPMRSKGYVTMLDPFKQIYGKRMGGLLFIPALMGEMFWAAAIFSALGATISVIIDVD 
VNISVIVSALIAILYTLVGGLYSVAYTDWQLFCIFIGLWISVPFALSHPAVTDIGFT 
AVHAKYQ S PWLGT I ES VEVYTWLDN FLLLMLGGI PWQAYFQRVLS S S S AT YAQVLS FL 
AAFGCLVMALPAICIGAIGASTDWNQTAYGYPDPKTKEEADMILPIVLQYLCPVYISF 
FGLGAVS AAVMS S ADS S I LS AS SMFARN I YQLS FRQNAS DKEI VWVMRIT VLVFGAS A 
TAMALLTKTVYGLWYLSSDLVYIIIFPQLLCVLFIKGTNTYGAVAGYIFGLFLRITGG 
EPYLYLQPLIFYPGYYSDKNGIYNQRFPFKTLSMVTSFFTNICVSYLAKYLFESGTLP 
PKLDVFDAWARHSEENMDKTILVRNENIKLNELAPVKPRQSLTLSSTFTNKEALLDV 
DSSPEGSGTEDNLQ" 



ORIGIN 



Query Match 7 8.9%; 

Best Local Similarity 86.8%; 
Matches 1513; Conservative 



Score 1375; 
Pred. No. 0; 
0; Mismatches 



DB 10; Length 1743; 
230; Indels 0; 



Gaps 



0; 



Qy 

Db 



1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

Ml I I I I I I I I I I I I I I I I I I M I I I I I I I III I I II I I I I I II II M III 
1 ATGCCTTTCCATGTGGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 60 



Qy 

Db 



61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

I I I I I I M I I II I I I I I I I I I I I I I I I I I I I I I II I I I M M I I I II I I I I III 

61 GTTGGAATATGGGCTGCATGGAA7VACCAAAAACAGCGGCAACCCAGAAGAGCGCAGTGAA 120 



Qy 

Db 

Qy 

Db 

Qy 

Db 



121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

II I II I M I I I II I I II I M I II I I I I I I II I I I II I I I I M I II I I I I M I I 
121 GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 180 

181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

I II M II I I I I I I I I I II II I I I I II I I M I II I I I M I I I II I I I M I I I 

181 ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGAAGCAGTGTATGGGCCAGGTTGT 240 

241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

II I I M I I I I I I I I I I I I I I I I M I I II I I II I I I I II I I I I I I I I I I I I I III 

241 GGTCTAGCTTGGGCTCATGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 300 



301 TTCTTTGCAT^CCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

II I I I I I I I I I I I I I I I I M I Mill I I M M I I I I I I I M I I I I II I II 

301 TTTTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTCAAACAG 360 

361 ATCTATGGAA7\ACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I II II M II I I I II II I I I II M II II II I I I I I I II M I I I M I I II I II I 
361 ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGAGAGATGTTC 420 

421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I I II I I M I I I I I I I I I I M II II II I I I I I II I I M I I M I I I I I I I I I I I I M 

421 TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCCACCATCAGCGTGATCATTGATGTGGAT 480 

481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

II I I I I II I I I I I II I I I I I I I I M I I I I II III II II M I I II I III 
481 GTGAACATATCGGTCATTGTCTCTGCACTCATTGCCATTCTTTATACCCTAGTGGGTGGG 540 

541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

Mill I I II II I I II I II II II I II I I I II II I II II II M I I II I II I II 
541 CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTTTTATAGGACTGTGG 600 

601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I II II I I M I I I I II I I II I I I II I II I I I I I I I II I I I II I I I I I I I I I II 
601 ATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAGCTGTG 660 

661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I I I I I I II I II M I II MUM MM Ml M M II I I M I III 

661 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 720 
721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

II M M I I I I I I II I I I M II I II M II I II II I M I I I I I II I I Mill I I I I I I 

721 CTTGATAATTTTCTGTTATTGATGCTGGGTGGAATCCCATGGCAAGCCTACTTCCAGAGG 780 
781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

II II II II II II M M I II II II II I M M M II I II II II II II M I II M III 

781 GTCCTCTCTTCATCCTCAGCCACCTATGCTCAGGTACTGTCCTTCCTGGCAGCTTTTGGG 840 
841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

II II II I II M M I I II II II II M I II I I I II II I II II M I I I M II 

841 TGCCTGGTGATGGCTCTACCCGCCATATGCATAGGAGCTATTGGAGCTTCCACAGACTGG 900 

901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

II M II II I II II III II M II II II I M I M I M II I I II II II II II I 

901 AACCAGACTGCCTACGGGTATCCAGATCCCAAGACTAAGGAGGAAGCAGACATGATTCTC 960 
961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II II II I II M I I M M M II I II I I I M II II M I M I 11' II I I M M III 

961 CCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGGCTTGGTGCTGTT 1020 

1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

II II M II II MM! Mill II II I I I I I I I II I II II I I I II II I I II 
1021 TCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGTTCTATGTTTGCT 108 0 

1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

Mill I II II II II I II II II II I M I I M II II M Mill Mill II II II I 
1081 CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 1140 

1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 



Db 


1141 


III 1 1 1 1 1 1 III 11 1 M II II II M 1 II II 1 1 M II II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 
ATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGCCATGGCTTTGCTGACGAAG 


1200 


Qy 


1201 


ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 

1 II II 1 1 1 1 M 1 II II 1 II II 1 1 II 1 1 1 1 M 1 1 II 1 1 1 1 1 M 1 II 1 1 1 II 1 Ml 

ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 


1260 


Db 


1201 


1260 


Qy 


1261 


CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 

1 1 1 II 1 1 1 M 1 M 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II II MINI II 

CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 


1320 


Db 


1261 


1320 


Qy 


1321 


TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 

1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 M 1 1 1 1 1 1 III 

TTTGGACTATTCCTGAGAATTACTGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 


1380 


Db 


1321 


1380 


Qy 


1381 


TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 

1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 
TTCTACCCTGGTTATTACTCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 


1440 


Db 


1381 


1440 


Qy 


1441 


ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 

II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCTTATCTAGCCAAGTAT 


1500 


Db 


1441 


1500 


Qy 


1501 


CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 M 1 1 1 II 1 1 1 1 1 1 1 
CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATGTATTTGATGCTGTTGTCGCAAGG 


1560 


Db 


1501 


1560 


Qy 


1561 


CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 

1 II 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 

CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGTWyVTATCAAATTAAAT 


1620 


Db 


1561 


1620 


Qy 


1621 


GAACTTGCACTTGTGT^AGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 

1 II 1 1 1 1 1 M 1 1 1 1 1 1 II M 1 1 1 1 M 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
GAACTTGCACCTGTGAAACCTCGGCAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAG 


1680 


Db 


1621 


1680 


Qy 

Db 


1681 
1681 


GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 I i i 1 1 1 1 [ 1 II M 1 1 1 1 1 1 1 1 M 1 1 1 1 i 1 1 1 1 1 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 I 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 > I 

GAGGCCCTCCTTGATGTTGATTCCAGTCCGGAGGGGTCTGGGACTGAAGATAATTTACAA 


1740 
1740 


Qy 


1741 


TGA 1743 




Db 


1741 


1 1 1 

TGA 1743 





RESULT 11 

E49872 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



E49872 1743 bp 

High-affinity choline transporter 
E49872 

E49872.1 GI:22554903 
JP 2001136976-A/4. 
Mus sp, 
Mus sp. 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 

1 (bases 1 to 1743) 
Haga,T. and Okuda^T. 



DNA 



linear PAT 27-AUG-2002 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; Mus. 



TITLE High-affinity choline transporter 

JOURNAL Patent: JP 2001136976-A 4 22-MAY-2001; 

SCIENCE & TECH AGENCY 
COMMENT OS Mus sp. (mouse) 

FN JP 2001136976-A/4 

PD 22-MAY-2001 

PF 27-DEC-1999 JP 1999368991 

PI TATSUYA HAGA.TAKASHI OKUDA 

PC C12N15/09,A01K67/027,A61K38/00,C07K14/47,C07K16/18,C07K19/00, 
PC C12N5/10, 

PC C12P21/02,C12P21/OB,C12Q1/00,C12N15/00,A61K37/02,C12N5/00 CC 
FH Key Location/Qualifiers 
FT CDS (1). .(1743). 

Location/Qualifiers 
1. .1743 
/organism="Mus sp , " 
/mol_type="genomic DNA" 
/db_xref="taxon: 10095" 

ORIGIN 



FEATURES 

source 



Query Match 78.8%; 
Best Local Similarity 86.7%; 
Matches 1512; Conservative 



Score 1373.4; 
Fred. No. 0; 
0; Mismatches 



DB 6; Length 1743; 



231; Indels 



0 ; Gaps 



0; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 

III I I I II II II II I II I I I I I I I I I I I III I I I II I I I I II II II III 
1 ATGTCTTTCCACGTAGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 

61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 

I I I I I I I I I I I M I I I I I I II I I I I I I I I I I I I I I I I I I I I I II I I I III III 
61 GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGCAACCCAGAAGAGCACAGTGAA 

121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 

I II II II II M II I II I I II I I II I I I I I II II II II II I I I I I I II I I II I I 
121 GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 



60 



60 



181 



181 



ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 

I I I I I I II I I I I I I I I II II I II II I II I I I I I I I I I I I I I I I I I I I I I I I 
ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGAAGCAGTGTATGGGCCAGGTTGT 



241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 

II I I I I II M I I I I I I I M I I I I II I II I I II I I II I I I I I I I I I II I I I I I III 

241 GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 



301 



301 



TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 

II I I I I I I I I I I i i I I I I I II I II I I I I I I I I I I I I I M I I I I I I I I I I I II 
,j.,p,pTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTTCAACAG 



361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 

M I I I I I II I I I II I M II I I M I I I I I I I I I I I II I II I I I 

361 ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGAGAGATGTTC 



421 



421 



TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 

I M I I I I I 11 I I I M I I I I II I I II II I I I I I M II I I I I M I I II I I I I I I I I I I 

TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCCACCATCAGCGTGATCATTGATGTGGAT 



120 



120 



180 



180 



240 



240 



300 



300 



360 



360 



420 



420 



480 



480 



Qy 



481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 54 0 



Db 


481 


GTGAACATATCGGTCATTGTCTCTGCACTCATTGCCATTCTTTATACCCTAGTGGGTGGG 


540 


Qy 


541 


CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 M Mill II 1 1 II 1 1 1 II II 1 1 MINI 

CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTTTTATAGGACTGTGG 


600 


Db 


541 


600 


Qy 


601 


ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 

II 1 II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 II 1 
ATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAGCTGTG 


660 


Db 


601 


660 


Qy 


661 


CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 


720 


Db 


661 


720 


Qy 


721 


CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
CTTGATAATTTTCTGTTATTGATGCTGGGTGGAATCCCATGGCAAGCCTACTTCCAGAGG 


780 


Db 


721 


780 


Qy 


781 


GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 

II 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 II 1 M 1 1 1 1 1 1 1 1 1 M III 

GTCCTCTCTTCATCCTCAGCCACCTATGCTCAGGTACTGTCCTTCCTGGCAGCTTTTGGG 


840 


Db 


781 


840 


Qy 


841 


TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 III II M 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 
TGCCTGGTGATGGCTCTACCCGCCATATGCATAGGAGCTATTGGAGCTTCCACAGACTGG 


900 


Db 


841 


900 


Qy 

Db 


901 
901 


AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 

1 1 1 1 1 1 1 1 1 1 1 II III 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 II II M 1 1 1 1 1 1 1 1 1 1 1 

AACCAGACTGCCTACGGGTATCCAGATCCCAAGACTAAGGAGGAAGCAGACATGATTCTC 


960 

960 


Qy 


961 


CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 

II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 II II 1 1 1 1 M 1 1 1 1 1 1 M 1 1 III 
CCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGGCTTGGTGCTGTT 


1020 


Db 


961 


1020 


Qy 


1021 


TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 

II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 Ml! II 1 1 II 1 1 II II 1 1 1 

TCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGTTCTATGTTTGCT 


1080 


Db 


1021 


1080 


Qy 


1081 


CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 

1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 M 1 1 1 1 1 1 1 1 II 1 1 1 1 1 Mill II II 1 II 
CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 


1140 


Db 


1081 


1140 


Qy 


1141 


ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 

III { 1 1 1 1 1 III 1 M II 1 1 II M 1 1 1 1 1 1 II 1 II II 1 M II 1 II II M II 1 II 

ATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGCCATGGCTTTGCTGACGAAG 


1200 


Db 


1141 


1200 


Qy 


1201 


ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 

M II M 11 II 1 II 1 II II II 1 II M II 1 II II II 1 1 II 1 II 1 1 II 11 M II III 
ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 


1260 


Db 


1201 


1260 


Qy 


1261 


CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 

II t II 1 II II II 1 1 M 1 II 1 1 II II 1 II II 1 II 1 II 1 1 II II II 1 II 1 II 

CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 


1320 


Db 


1261 


1320 


Qy 


1321 


TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 


1380 



I III II I I I I I I I I I I I 11 I I I I I I II II II I I I M M I I I I I I I I I III 



Db 



1321 TTTGGACTATTCCTGAGAATTACTGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1380 



Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Qy 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 



TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 144 0 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I 
TTCTACCCTGGTTATTACTCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 1440 

ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

II II I II I I II I II III II M I II II I M II II I II I I I I I II I I I I I I I I 
ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCTTATCTAGCCAAGTAT 1500 

CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I II I I I II I I I I II I I I I I I I I I M I I II I I I I I I I I II I I I I I M I I I I I I I I I I 
CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATGTATTTGATGCTGTTGTCGCAAGG 1560 



II M I I I I I II I I I I I II I I I I I I Mill II II I I I I I I I I I I I I I M I M II 



GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 168 0 

I I I I II I II I I I I II I M M I I I I I I I I I I I M II I I I I I I I I I I I I I I I I I 
GAACTTGCACCTGTGAAACCTCGGCAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAG 1680 

GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

I I I I II M I I II II I II M I I I I I I I II M I I M I I t I I I I I I I I II I I I Mill 

GAGGCCCTCCTTGATGTTGATTCCAGTCCGGAGGGGTCTGGGACTGAAGATAACTTACAA 1740 



I I 



RESULT 12 
BD012720 
LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



Craniata; Vertebrata; Euteleostomi ; 
Sciurognathi; Muridae; Murinae; Mus . 



BD012720 1743 bp DNA linear PAT 02-AUG-2002 

High-affinity choline transporter. 
BD012720 

BD01272 0. 1 GI: 22092909 
WO 0116315-A/4. 
Mus mus cuius (house mouse) 

Mus mus cuius 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodentia; 
1 (bases 1 to 1743) 
Haga^T. and Okuda^T. 
High-affinity choline transporter 
Patent: WO 0116315-A 4 08-MAR-2001; • - 
JAPAN SCIENCE AND TECHNOLOGY CORP,TATSUYA HAGA.TAKASHI OKUDA 
COMMENT OS Mus mus cuius (mouse) 

PN WO 0116315-A/4 
PD 08-MAR-2001 

PF 18-AUG-2000 WO 2000JP005545 

PR 27-AUG-1999 JP 99P 240642 , 27-DEC-1999 JP 99P 368991 PI 
TATSUYA HAGA.TAKASHI OKUDA 

PC C12N15/12,C07K14/47,C12Q1/68,C07K19/00,C07K16/18,C12N5/10, PC 
A61K38/17, 

PC A61K45/00,A61P25/28,G01N33/53,A01K67/027 
CC 



FH Key Location/Qualifiers 
FT CDS (1). .(1743). 

FEATURES Location/Qualifiers 
source 1. .1743 

/organisin="Mus musculus" 
/mol_type=" genomic DNA" 
/db_xref="taxon: 10090" 

ORIGIN 

Query Match 78.8%; Score 1373.4; DB 6; Length 1743; 

Best Local Similarity 86.7%; Pred. No. 0; 

Matches 1512; Conservative 0; Mismatches 231; Indels 0; Gaps 0; 

Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

III I I II I I I M I I I I II I I I M III I I M I I I M II II 

Db 1 ATGTCTTTCCACGTAGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 60 

Qy 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCG7\A 120 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I M M I II 111 III 

Db 61 GTTGGAATATGGGCTGCATGGAAAACCATWsji^CAGCGGCAACCCAGAAGAGCACAGTGAA 12 0 

Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 18 0 

I I II I M I I I I II Mill II I I I I I I I I I I I I M I I I I I I I M M I I I I I I I I 
Db 121 GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 180 

Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

I I I I I I I I I I I I II M II I I I I I I I I I I I I I M M I I I I I I I I I I II I I I I 

Db 181 ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGAAGCAGTGTATGGGCCAGGTTGT 24 0 

Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

II I M I I I I I I II M I I II I I I I I II I I I I I I I I I I I I I I I I I I I M I I I I I III 

Db 241. GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 300 

Qy 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

II I I I I I I I II I I I I I I I I I I Mill II M I I I I I I II I II I I I I I II M II 
Db 301 TTTTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTTCAACAG 360 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

II I II I II II I I II II I I I II II II II II I I M II II II II I II I I II M II 
Db 361 ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGAGAGATGTTC 420 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I II II I II M I II I I I II II I II II II II I I I I I I I M II I M I I I I I II M II I I 
Db 421 TGGGCTGCAGCT^TTTTCTCTGCATTAGGGGCCACCATCAGCGTGATCATTGATGTGGAT 480 

Qy 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 54 0 

II II II II I II I I II I I I I II I I M M I M I III II II II I II II III 
Db 481 GTGAACATATCGGTCATTGTCTCTGCACTCATTGCCATTCTTTATACCCTAGTGGGTGGG 540 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

Mill II M I M I II I M I M II M Mill II II I M II M MM II II II 

Db 541 CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTTTTATAGGACTGTGG 600 



Qy 

Db 



601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

II II I II II I I II II I II I II II I M M M M I II II I I II Mill MUM 
601 ATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAGCTGTG 660 



Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I III I I I I I M I II I III 

Db 661 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 720 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

M III II II I I I M II I I I M M I I I I I I I I I I I II M I I I II II Mill I I I I I I 
Db 721 CTTGATAATTTTCTGTTATTGATGCTGGGTGGAATCCCATGGCAAGCCTACTTCCAGAGG 780 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 84 0 

II II I I I I I I I I I I I I I I I I I I I I II I I II II II I I I I I I I I M I I I I I I I I III 

Db 781 GTCCTCTCTTCATCCTCAGCCACCTATGCTCAGGTACTGTCCTTCCTGGCAGCTTTTGGG 84 0 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I II I II I I I I I II I II M I I II III M II I I M I I M M I I I I I I I I I 

Db 841 TGCCTGGTGATGGCTCTACCCGCCATATGCATAGGAGCTATTGGAGCTTCCACAGACTGG 900 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

M II II II M I II III I I I I I I I I I I I I I I I I I II II I I I M I I I I I I I I 
Db 901 AACCAGACTGCCTACGGGTATCCAGATCCCAAGACTAAGGAGGAAGCAGACATGATTCTC 960 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II II I I I I I M I I II I I I I II I I I I I I I I II II I I I I I I M I I I I II M III 

Db 961 CCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGGCTTGGTGCTGTT 1020 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 108 0 

II I I I I M I I I I I I I Mill II M II II II II II II II II I II II M II 

Db 1021 TCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGTTCTATGTTTGCT 108 0 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

Mill I I II I I I I I I I II I M I I II II M I I I II II I I II I I II I I M II I II 
Db 1081 CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 1140 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

III I I I I I I III I II II II II I I I I M II II I II II I II II I II I I II I II II 

Db 1141 ATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGCCATGGCTTTGCTGACGAAG 1200 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

M II I M I II I I M I I I M I M I II I II I II I I I I I MUM I M I I I I M III 

Db 1201 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1260 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 132 0 

II II I I I I I II I I I I I I M I I I I I I i I I II I M I I I I I II II M I II I M 

Db 1261 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1320 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I Ml II I I II I M M II I II I I I I I I M I I I II I M II I Mill Mi I I I 

Db 1321 TTTGGACTATTCCTGAGAATTACTGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1380 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 144 0 

M I I II I I M I II II I I II II I I II II II II I I I I I M I I II I II II I M I 

Db 1381 TTCTACCCTGGTTATTACTCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 1440 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

M II II II II I M I II M II M I II M II I I I I I II II II II II II M II I 

Db 1441 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCTTATCTAGCCT^AGTAT 1500 



1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 



I I M I I I I I I I M I I I I I M I I I II I I I I I I I I I I I I M I I I I I I I I I I I I Mill 

Db 1501 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATGTATTTGATGCTGTTGTCGCAAGG 1560 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

M I II I I I I I I I M I II II I II I I I I I I I I I I I I I I I I I I I M I I I II I I I I I 

Db 1561 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAATATCAAATTAAAT 1620 

Qy 1621 G7\ACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I II II I I I I I I I M II M II I II I I I I I M I I I I I I I I I I I I I II I II I I I I 
Db 1621 GAACTTGCACCTGTGAAACCTCGGCAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAG 1680 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

I I I I I I I I I M I M II I I I M I I I M M II I I I I I I I I I I I I I I I I I I I I I II II 

Db 1681 GAGGCCCTCCTTGATGTTGATTCCAGTCCGGAGGGGTCTGGGACTGAAGATAACTTACAA 1740 

Qy 1741 TGA 1743 

I I I 

Db 1741 TGA 1743 



RESULT 13 

AX080443 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



AX080443 4938 bp DNA linear PAT 22-FEB-2001 

Sequence 1 from Patent WO0078950. 

AX080443 

AX080443.1 GI: 13159872 

Mus musculus (house mouse) 

Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebra ta; Euteleostomi; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 
1 

Sierzega^M. and Albrandt^K. 

Differentially expressed genes in the adipocytes of obese mice 
Patent: WO 0078950-A 1 28-DEC-2000; 
AMYLIN PHARMACEUTICALS, INC. (US) 

Location/Qualifiers 

1. .4938 

/organism="Mus musculus" 
/mo l_type="unas signed DNA" 
/db_xref="taxon: 10090" 
/note="P4P6Bl" 



ORIGIN 



Query Match 7 8.8%; 

Best Local Similarity 86.7%; 
Matches 1512; Conservative 



Score 1373.4; 
Pred. No. 0; 
0; Mismatches 



DB 6; Length 4938; 

231; Indels 0; Gaps 



0; 



Qy 



Db 



1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

I II I I II II I II I I I II I I I I M I I I I I III I I I I I I I I I II II I III 
247 ATGTCTTTCCACGTAGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATACTTCTG 306 



Qy 

Db 



61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I M M III 

307 GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGCAACCCAGAAGAGCGCAGTGAA 366 



Qy 



121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 18 0 



367 GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 426 
181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 24 0 

I I M I I I I I I I I I I I I II I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I 

427 ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGAAGCAGTGTATGGGCCAGGTTGT 486 

241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

II I I I I II I I I I I II I I M I I I I M I I I II I I I I I I II I II I I M I II I I II III 

487 GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 546 

301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGC/SA 360 

II I I I I I I I I I I II I II I I I I Mill I I M I I I I I II II I I I I I I II I II II 

547 TTTTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTTCAACAG 606 

361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I I I I I I I I I I I I I I I I I I M II II M II I I I I I II I M I I I I I I I I I I II I 
607 ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGAGAGATGTTC 666 

421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I I I I I I I I I I I I I I I I I I I I II II II I I I I I I I I I M I I I I I I I I I I II I I I II I 

667 TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCCACCATCAGCGTGATCATTGATGTGGAT 72 6 

481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

II I I I I II II I I I I I II II I I I I I I I I I I I I III II II II I I I I I III 
727 GTGAACATATCGGTCATTGTCTCTGCACTCATTGCCATTCTTTATACCCTAGTGGGTGGG 78 6 

541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 
Mill II I I II II II I II II I II II Mill II I II I I II II I I II II I I II 

787 CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTTTTATAGGACTGTGG 84 6 

601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

II II I I I I I I I I II I I I I II M I II I M I I I I I I I II II I I II I I I II II I I 

847 ATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAGCTGTG 906 

661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I I II I I I I I II I I I II II I II I I II II I II I III II M M I II I I III 

907 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 966 

721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

II I II II I II I I I M I II II II II I II I M I I I I I I I II M II I I I II I I I I II I I 

967 CTTGATAATTTTCTGTTATTGATGCTGGGTGGAATCCCATGGCAAGCCTACTTCCAGAGG 1026 

781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 84 0 

II II I I I I I I I I I II I I I I I I I II I II I I I II I II I I II II I II I M II II I III 
1027 GTCCTCTCTTCATCCTCAGCCACCTATGCTCAGGTACTGTCCTTCCTGGCAGCTTTTGGG 1086 

841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

II II I II I I II I I I I M I I II I I III II II I I II M I I II I I I II II I I 
1087 TGCCTGGTGATGGCTCTACCCGCCATATGCATAGGAGCTATTGGAGCTTCCACAGACTGG 1146 

901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I I I II I I I I M II III M II II II I I M I I II I M M I II I I I I I II II I 

1147 AACCAGACTGCCTACGGGTATCCAGATCCCAAGACTAAGGAGGAAGCAGACATGATTCTC 12 06 

961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II II II I II I I I I I I II II II II I II I II II II I I II I I I I I I I II M I III 



Db 1207 CCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGGCTTGGTGCTGTT 1266 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 108 0 

II M II M II Mill Mill II II i II II I MM II I M II II I I M I I 

Db 1267 TCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGTTCTATGTTTGCT 1326 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

Mill II I M II II M II M II M M M M II II II Mill II II I II I I I II 

Db 1327 CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 138 6 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 12 00 

III I II II I III M I I II I II II I M I II II M I II I II II I II II II II I II 

Db 1387 ATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGCCATGGCTTTGCTGACGAAG 1446 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

II M II II II II II I II II I II I II II II II I M II [Mill I II II M II III 

Db 1447 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1506 

Qy 12 61 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

II I II M I M I M I M I II M I II I II I I I I M I I II I II II II II M II 
Db 1507 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1566 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I III II II II II II II I I II I II M II M II II I M M I II II II II Ml 

Db 1567 TTTGGACTATTCCTGAGAATTACTGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 162 6 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

I II I II I I I II I I II I I I II I I II I II II M I I M II I I I II I II II I I I I 

Db 1627 TTCTACCCTGGTTATTACTCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 1686 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

II II I II M II II I M II M II II II II M II I I II M I M II II II I I M 

Db 1687 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCTTATCTAGCC7\AGTAT 174 6 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

II II II II II I II M II M II M II II II II II II II M II II II I II II I Mill 
Db 1747 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATGTATTTGATGCTGTTGTCGCAAGG 1806 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

II II II M II I M II II II II I II II II I I I II I II II II II II I I II M I II 

Db 18 07 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAATATCAAATTAAAT 18 66 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

II M II II II II II II M II Mini I II II M II II M II I M M M M II 

Db 1867 GAACTTGCACCTGTGAAACCTCGGCAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAG 1926 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

II II II I I II II II II M II II I II M I M II II II II M I II II II II I MM! 
Db 1927 GAGGCCCTCCTTGATGTTGATTCCAGTCCGGAGGGGTCTGGGACTGAAGATAACTTACAA 1986 

Qy 1741 TGA 1743 

I I I 

Db 1987 TGA 1989 
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MMU401467 1743 bp mRNA linear ROD 16-AUG-2000 

Mus musculus itiRNA for high affinity choline transporter (CHTl 
gene) . 
AJ401467 

AJ401467.1 GI:9843808 

ChTl gene; high affinity choline transporter. 
Mus musculus (house mouse) 

Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus - 
1 

Wieland^A., Bonisch,H. and Bruss,M. 

Molecular cloning of the human and murine high affinity choline 

transporters and characterization ofthe human gene-structure 

Unpublished 

2 (bases 1 to 1743) 

Bruess M. 

Direct Submission 

Submitted ( 14-AUG-2000 ) Bruess M., University of Bonn, Pharmacology 
and Toxicology, Reuter str. 2b, D-53113 Bonn, GERMANY 

Location/Qualifiers 

1. .1743 

/organism="Mus musculus" 

/mol_type="mRNA" 

/strain="BALB/cJ" 

/db_xref="taxon: 10090" 

/tissue_type="brainstem" 

1. .1743 

/gene="CHTl" 

1. .1743 

/gene="CHTl" 

/function="sodium- and chloride-dependent reuptake of 

choline" 

/ codon_start=l 

/ e vidence=exper iment al 

/product="high affinity choline transporter" 
/protein_id="CAC03719. 1" 
/db_xref="GI: 9843809" 
/db_xref="GOA:Q9ESW5" 
/db_xr e f =" S PTREMBL : Q9ESW5 " 

/translation="MSFHVEGLVAIILFYLLIFLVGIWAAWKTKNSGNPEEHSEAIIV 
GGRDIGLLVGGFTMTATWVGGGYINGTAVAVYGPGCGLAWAQAPIGYSLSLILGGLFF 
AKPMRSKGYVTMLDP FQQI YGKRMGGLLFI PALMGEMFWAAAI FSALGATI SVI I DVD 
VNISVIVSALIAILYTLVGGLYSVAYTDWQLFCIFIGLWISVPFALSHPAVTDIGFT 
AVHAKYQSPWLGTIESVEVYTWLDNFLLLMLGGIPWQAYFQRVLSSSSATYAQVLSYL 
AAFGCLVMALPAICIGAIGASTDWNQTAYGYPDPKTK£EADMILPIVLQYLCPVYISF 
FGLGAVSAAVMSSADSSILSASSMFARNIYQLSFRQNASDKEIVWVMRITVLVFGASA 
T7\MALLTKTVYGLWYLSSDLVYIIIFPQLLCVLFIKGTNTYGAVAGYIFGLFLRITGG 
EPYLYLQPLIFYPGYYSDKNGIYNQRFPFKTLSMVTSFFTNICVSYLAKYLFESGTLP 
PKLDVFDAWARHSEENMDKTILVRNENIKLNELAPVKPRQSLTLSSTFTNKEALLDV 
DSSPEGSGTEDNLQ" 



ORIGIN 



Query Match 78.4%; Score 1367; DB 10; Length 1743; 

Best Local Similarity 86.5%; Pred. No. 0; 

Matches 1508; Conservative 0; Mismatches 235; Indels 0; Gaps 0; 



Qy 1 ATGGCTTTCCATGTGG7\AGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

III II I I II I II I I I I I I II I I II I M I III I I I I I II I I II II II III 

Db 1 ATGTCTTTCCACGTAGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 60 

Qy 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

II II I II II I II I I II I I I I I II I I I M I I I I I I I I I I I I I I I I I I I III III 

Db 61 GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGCAACCCAGAAGAGCACAGTGAA 120 

Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 18 0 

II I I I I M I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I 

Db 121 GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 180 

Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

I I M II I I M II I I I I II I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 181 ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGTAGCAGTGTATGGGCCAGGTTGT 240 

Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

II I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I III 

Db 241 GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 300 

Qy 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I II 

Db 301 TTTTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTTCAACAG 360 

Qy 3 61 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 42 0 

I I I I I I II I II I I I II I II II II II II II I I I I I I I I I I I I I I I II MINI 

Db 361 ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGAGAGATGTTC 420 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I M I I I I I M 11 I I I II I II I I II II I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 421 TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCCACCATCAGCGTGATCATTGATGTGGAT 480 

Qy 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 54 0 

II I I I I II I I I I I I I I I I I II I I I I I I I I I III M II II I II I I III 

Db 481 GTGAACATATCGGTCATTGTCTCCGCACTCATTGCCATTCTTTATACCCTAGTGGGTGGG 540 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I I I II I I M I I I I II I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 541 CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTTTTATAGGACTGTGG 600 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M II II I I II II II II II 

Db 601 ATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAGCTGTG 660 

Qy 661 CATGCCA7\ATACCAAi^AGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I I I I I I I I I I I I i ' I II I I I I I I I I I I I I I I I III I I I I I I I I I I I III 

Db 661 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 720 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 7 80 

I I I I I I I I I I I I I I I I M I I I I II I II I I I I II I I I I I I I I I M I I I I I I I I I I I I 

Db 721 CTTGATAATTTTCTGTTATTGATGCTGGGTGGAATCCCATGGCAAGCCTACTTCCAGAGG 780 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I M I I I I I I I I I I I I I III 

Db 781 GTCCTCTCTTCATCCTCAGCCACCTATGCTCAGGTACTGTCCTACCTGGCAGCTTTTGGG 840 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 



Db 841 TGCCTGGTGATGGCTCTACCCGCCATATGCATAGGAGCTATTGGAGCTTCCACAGACTGG 900 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCTSlAGACTACAGAAGAGGCAGACATGATTTTA 960 

I I I I I I I I I I I II III I I I II II I I II I II I II II II I II M I I I I I II I 

Db 901 AACCAGACTGCCTACGGGTATCCAGATCCCAAGACTAAGGAGGAAGCAGACATGATTCTC 960 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II II I II M II II II I I I I I I I I I I I I II M II I I I I I I II I I I I I I I I III 

Db 961 CCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGGCTTGGTGCTGTT 1020 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

M I I I I I I M I I I I I I I I M II II I I I I I I MM II I I I II M II I I M 

Db 1021 TCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGTTCTATGTTTGCT 108 0 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

II I I I M I I II M I I II M I II I I II I II M I I I II Mill I II II II II II I 

Db 1081 CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 114 0 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 12 0 0 

III I II I I I III II I I I I I II II I II I II I I II II II I II II II II I I I I M I 

Db 1141 ATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGCCATGGCTTTGCTGACGAAG 12 00 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

II II II II I M M I II I II I II II I M M II I II I II I II I I II I II II I III 

Db 1201 ACTGTGTATGGGCTGTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1260 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

I II II II I II II M M I II I I I M II II II II II II I I M II I I I I I I II 
Db 1261 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1320 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I III II I I II II II I I I II II II II II II I I M II I II I II I M M I III 

Db 1321 TTTGGACTATTCCTGAGAATTACTGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1380 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAG7U\ATTTCCATTTAAA 144 0 

II I II II II M I I I M I MM I II I M II I I II II II II I M I I II I I M I 

Db 1381 TTCTACCCTGGTTATTACTCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 1440 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

II II II II M M II II I M II I II I II II I I II I II I II II II I M II II I 

Db 1441 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCTTATCTAGCCAAGTAT 1500 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

II II II I II II I M I II II II M M II II I II I M M II I M II II II I I I MM! 

Db 1501 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATGTATTTGATGCTGTTGTCGCAAGG 1560 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

II M II II II I M II II II Mill Mill II I I M II II II II II I I II II II 

Db 1561 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAATATCAAATTAAAT 162 0 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I I I II I I M I MUM II II MUM I II I II I I I II II I II II II I II I II 

Db 1621 GAACTTGCACCTGTGAAACCTCGGCAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAG 1680 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATT^TTTACAG 1740 

M M II II I II M II II II I M M I II I M II II II II I I I M M II I M II I M 



Db 



1681 GAGGCCCTCCTTGATGTTGATTCCAGTCCGGAGGGGTCTGGGACTGAAGATAACTTACAA 174 0 



Qy 1741 TGA 1743 

I I I 

Db 1741 TGA 1743 
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TMA420808 2528 bp mRNA linear VRT 27-NOV-2001 

Torpedo marmorata itiRNA for high affinity choline transporter (CHTl 
gene) . 
AJ420808 

AJ42 0808.1 GI: 17148508 

CHTl gene; high affinity choline transporter. 
Torpedo marmorata (marbled electric ray) 
Torpedo marmorata 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Chondrichthyes ; 
Elasmobranchii ; Squalea; Hypnosqualea; Pristiora j ea; Batoidea; 
Torpedinif ormes ; Torpedinoidei ; Torpedinidae ; Torpedo . 
1 

Guermonprez^ L. , O'Regan^S., Meunier,F.M. and 
Morot-Gaudry-Talarmain^ Y. 

Cyclosporin, FK506 and rapamycin inhibit neuronal choline uptake 

via calcineurin-dependent and independent mechanisms 

Unpublished 

2 (bases 1 to 2528) 

O ' Regan, S . 

Direct Submission 

Submitted (21-NOV-2001 ) O'Regan S,, Neurobiologie Cellulaire et 
Moleculaire, C.N.R.S., 1 av de la Terrasse, F-91198 Gif-sur-Yvette, 
FRANCE 

Location/Qualifiers 
1. .2528 

/organism="Torpedo marmorata" 

/mol_type="mRNA" 

/ db_x r e f = " t axon : 7 7 8 8 " 

/clone="tH312" 

/tissue_type="electric lobe" 

/tissue_lib="lambda ZAPII ELL" 

1. .2528 

/gene="CHTl" 

49. .1803 

/gene="CHTl" 

/function="neuronal Na-dependent choline transporter" 

/codon_start=l 

/evidence=experimental 

/product="high affinity choline transporter" 
/protein_id="CAD12727 .1" 
/db_xref="GI: 17148509" 
/ db_x r e f = " GOA : Q 8 UW F 0 " 
/db_xref-"SPTREMBL : Q8UWF0" 

/translation-"MTVHIDGIVAIVLFYLLILFVGLWAAWKSKNTSMEGAMDRSEAI 
MIGGRDIGLLVGGFTMTATWVGGGYINGTAEAVYVPGYGLAWAQAPFGYALSLVIGGL 
FFAKPMRSRGYVTMLDPFQQMYGKRMGGLLFI PALLGEI FWSAAI LSALGATLSVIVD 
ININVSVWSAVIAVLYTLVGGLYSVAYTDWQLFCIFLGLWISIPF7VLLNPAVTDII 
VTANQEVYQEPWVGNIQSKDSLIWIDNFLLLMLGGIPWQVYFQRVLSASSATYAQVLS 



FLAAFGCVLMAIPSVLIGAIGTSTDWNQTSYGLPGPIGKNETDMILPIVLQHLCPPYI 
S FFGLGAVSAAVMS SADS S I LSAS SMFARNI YHLAFRQEASDKEI VWVMRI TI FLFGG 
AATSMALLAQS I YGLWYLS S DLVYVI I FPQLI SVLFVKGTNT YGS I AGYI I GFLLRI S 
GGEPYLHMQPFIYYPGCYLDHSFGDDPVYVQRFPFKTMAMLFSFLGNTGVSYLVKYLF 
VSGILPPKLDFLDSWSKHSKEIMDKTFLMNQDNITLSELVHVNPIHSASVSAALTNK 
EAFEDIEPNPELSKSGND" 
polyA_signal 2487. .2492 
/gene="CHTl" 

ORIGIN 

Query Match 49.7%; Score 867; DB 5; Length 2528; 

Best Local Similarity 69.3%; Pred. No. 2.9e-227; 

Matches 1217; Conservative 0; Mismatches 520; Indels 18; Gaps 2; 
Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

I I I I Ml Mill I I M M I M II I I II I II II I I I I I 

Db 4 9 ATGACCGTTCACATCGATGGGATCGTAGCGATCGTCCTGTTTTACTTGTTAATCTTATTT 108 

Qy 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCA GCGCAGAAGAGCGC 114 

I III I M I I I II I II II I II I I I I II I I I I II I I I II I 

Db 109 GTTGGATTATGGGCTGCTTGGAAAAGTAAAAACACGTCAATGGAAGGAGCAATGGATCGG 168 

Qy 115 AGCGAAGCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATG 174 

I I I I I I M I II I I I II I I I I I I I II I I II I I II I I I I I M II I I 

Db 169 AGTGAAGCTATAATGATTGGGGGAAGAGATATCGGGCTGCTGGTTGGTGGCTTCACAATG 228 

Qy 175 ACAGCTACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCA 234 

II II II I I I I I I I I II II II I I II I I I I I I I I I I II II II I I I I II II 

Db 229 ACCGCAACTTGGGTCGGTGGCGGTTATATCAATGGGACAGCAGAGGCGGTTTATGTTCCT 288 

Qy 235 GGTTATGGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGT 294 

I II I I I I II II I I I M I I I II I MINI M II I I I II II I I 

Db 289 GGGTACGGCTTGGCCTGGGCGCAGGCTCCCTTCGGATACGCACTCAGCCTGGTTATTGGC 34 8 

Qy 295 GGCCTGTTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTT 354 

III I II II II I I II I Mill III III II I I I I II I II I I I I I I I I I I 

Db 349 GGCTTATTTTTCGCTAAACCCATGCGCTCACGGGGTTACGTGACCATGCTGGACCCGTTT 408 

Qy 355 CAGCAAATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAA 414 

II II II II II I I I I I I II I I III I II II II II II II I I I I Ml 

Db 4 09 CAACAGATGTACGGTAAACGAATGGGAGGATTGCTCTTCATCCCCGCTCTCCTGGGGGAA 4 68 

Qy 415 ATGTTCTGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGAT 474 

I M II II II II I II II I Mill I II I II I II II II I II III 

Db 469 ATCTTCTGGTCTGCAGCCATACTGTCCGCGCTAGGTGCAACTTTAAGCGTGATTGTGGAC 528 

Qy 475 GTGGATATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTG 534 

I I I II I I I I II I I I II I I II II I I I M M II M I 

Db 529 ATCAATATAAACGTATCAGTGGTAGTTTCCGCTGTGATCGCTGTATTATACACTCTGGTC 58 8 

Qy 535 GGAGGGCTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGG 594 

II III I II II M M Mill II I II II I III I II I II II I II II I I 

Db 589 GGCGGGTTATACTCGGTCGCGTACACAGATGTCGTCCAGTTGTTTTGCATCTTCTTAGGT 648 

Qy 595 CTGTGGATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACT 654 

I I I I II I I II II I M II I I I M I I I I I I I I Mill Ml 

Db 649 TTGTGGATCAGCATCCCTTTCGCCCTCTTAAATCCCGCTGTTACGGATATTATCGTGACC 708 



Qy 655 GCTGTGCATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTAC 714 

M III II II I I I I I I I M I I I I I I I I I I II I 

Db 709 GCAAATCAAGAAGTTTATCAGGAGCCTTGGGTGGGAAATATACAATCAAAGGACAGTTTA 7 68 

Qy 715 TCTTGGCTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTT 774 

III I I I I I I I M I II M I II M I I II I I M I I I I I I I I II I III Ml 
Db 769 ATCTGGATTGACAACTTTCTATTACTGATGCTGGGTGGAATCCCGTGGCAAGTATATTTT 828 

Qy 775 CAGAGGGTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCT 834 

I I I I I II II III I I I I II II I I I I I I I I II II I I I I I I I I I I I M I II 
Db 829 CAGAGAGTCCTTTCTGCTTCTTCTGCTACCTATGCGCAAGTCCTGTCCTTTCTGGCTGCC 888 

Qy 835 TTCGGGTGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACA 894 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I 
Db 889 TTCGGTTGCGTTCTCATGGCCATCCCGTCTGTTCTCATCGGTGCAATAGGAACATCCACT 94 8 

Qy 895 GACTGGAACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATG 954 

M I I I I M M I M I I I I I I I I I I I I III I I I M I I I I I I I 

Db 94 9 GACTGGAATCAGACTTCCTATGGCTTGCCAGGCCCTATAGGCAAAAATGAGACTGATATG 1008 

Qy 955 ATTTTACCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGT 1014 

I I I I I II II II I I I I I I MM II M M II I II II M I II I II I II 

Db 1009 ATTTTGCCGATCGTGCTGCAGCATCTGTGTCCACCCTACATTTCCTTTTTTGGTCTTGGC 1068 

Qy 1015 GCAGTTTCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATG 1074 

II M I II I M II I II I I II II I I II II II I I M I I II I II II II I M M III 

Db 1069 GCTGTCTCTGCTGCTGTGATGTCATCGGCTGATTCTTCTATCTTATCAGCAAGTTCTATG 1128 

Qy 1075 TTTGCACGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTT 1134 

II II I Mill II Mill III I II II II I M I II II I I II I I M II M II 
Db 1129 TTTGCTCGGAATATTTACCATCTTGCTTTCAGACAAGAGGCTTCAGACAAAGAAATAGTG 1188 

Qy 1135 TGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTG 1194 

Mill II I I I II I I II I III I I II II II I II M II II I Mill I II II 

Db 118 9 TGGGTAATGCGAATCACCATATTTCTATTTGGAGGAGCTGCAACATCTATGGCATTGCTT 124 8 

Qy 1195 ACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTC 1254 

I Ml I II I II M I I M M II II II II II II I I II II I M II II 
Db 1249 GCTCAATCAATCTATGGCCTCTGGTATCTGAGCTCAGATCTTGTCTACGTCATTATCTTT 1308 

Qy 1255 CCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGT 1314 

I I I I I II II I I I M II I M II II I I II I I M II I I I Mill 

Db 1309 CCTCAATTAATATCAGTGCTCTTCGTCAAGGGAACAAACACATATGGGTCTATTGCTGGA 1368 

Qy 1315 TATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCC 1374 

II I I II M I M II I M I M I M II II M II I II I I M II 

Db 1369 TATATCATTGGCTTTTTGCTTCGGATTAGTGGTGGTGAACCATATTTACATATGCAGCCA 1428 
Qy 1375 TTGATCTTCTACCCTGGCT ATTACCCTGATGATAATGGTATATATAAT 1422 

II II I II II II I I I II I I M Mill Mill I 

Db 142 9 TTTATTTATTACCCTGGATGCTATTTAGATCATTCCTTTGGAGATGATCCTGTTTATGTT 14 8 8 

Qy 1423 CAGAAATTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATC 1482 

II M II M II II I I II II I M M I I M II II I II I M I I I 

Db 1489 CAGAGATTTCCCTTTAAAACCATGGCAATGTTATTCTCCTTCTTGGGCAACACTGGTGTA 1548 



Qy 1483 TCCTATCTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTT 1542 

II I I II I I M I I I I II II I I I I I II II I I I I M I I I II I II II I II 
Db 1549 TCATATCTTGTCAAGTACCTGTTCGTAAGTGGAATATTGCCACCAAAATTAGACTTCCTT 1608 

Qy 1543 GATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAAT 1602 

II II I II II I II I II II I II II I II II I I I I II I I III I 

Db 1609 GACAGCGTTGTATCAAAACACAGTAAGGAAATCATGGACAAAACATTCTTGATGAATCAG 1668 

Qy 1603 GAAAATATTAAATTAGATGAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGC 1662 

MINIMI II II I I I I II I I I II I II I I I I III 

Db 1669 GACAATATTACTTTGTCAGAGCTGGTGCATGTTAATCCAATACACAGTGCTTCAGTTAGT 1728 

Qy 1663 TCAACTTTCACCAATAAAGAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGG 1722 

I I I M I II II II I I I II I I II I I II I I I I I I II III I 

Db 1729 GCTGCTTTGACCAATAAGGAAGCATTTGAAGACATTGAGCCAAATCCTGAACTTTCTAAG 1788 

Qy 172 3 ACTGAAGATAATTTA 17 37 

II M M I I 

Db 1789 TCAGGCAATGATTGA 1803 
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ALIGNMENTS 



RESULT 1 
AAF81712 

ID AAF81712 standard; cDNA; 1743 BP. 
XX 

AC AAF81712; 
XX 

DT Ol-JUN-2001 (first entry) 
XX 

DE Human high affinity choline transporter protein encoding cDNA. 

XX 

KW High affinity choline transporter; cho-1; Alzheimer's disease; diagnosis; 

KW ss . 

XX 

OS Homo sapiens . 



XX 

FH Key Location/Qualifiers 

FT CDS 1. .1743 

FT /*tag= a 

FT /product= "high affinity choline transporter" 
XX 

PN WO200116315-A1. 
XX 

PD 08-MAR-2001. 
XX 

PF 18-AUG-2000; 2000WO- JP005545 . 
XX 

PR 27-AUG-1999; 99 JP-00240642 . 

PR 27-DEC-1999; 99 JP-00368991 . 
XX 

PA (NISC-) JAPAN SCI & TECHNOLOGY CORP. 
XX 

PI Haga T, Okuda T; 
XX 

DR WPI; 2001-226688/23. 

DR P-PSDB; AAB74665. 
XX 

PT New rat and human spinal cord high affinity choline transporters^ useful 

PT in diagnosis of Alzheimer's disease and screening promoters as drugs for 

PT treating Alzheimer's disease. 
XX 

PS Claim 9; Page 71-75; 90pp; Japanese. 
XX 

CC The present sequence encodes a human (Homo sapiens) high affinity choline 

CC transporter protein designated cho-1. The cho-1 protein has nootropic and 

CC neuroprotective activities. The cho-1 polynucleotide and protein can be 

CC used for the diagnosis of diseases related to the expression of cho-1 by 

CC comparing the cho-1 polynucleotide sequence in a sample to that of a 

CC control. Drug compositions containing the cho-1 protein or expression 

CC promoters or inhibitors of cho-1 are useful for treating disorders 

CC characterised by abnormal levels of cho-1, such as Alzheimer's disease 
XX 

SQ Sequence 1743 BP; 412 A; 393 C; 406 G; 532 T; 0 U; 0 Other; 

Query Match 100.0%; Score 1743; DB 4; Length 1743; 
Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1743; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

I I I I I I I MM I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCrTCTAATTTTGCTG 60 

Qy 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCG/iA 120 

M M M M M M M I M M M M M M M M M M M M M M M M M M M M M M I 

Db 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

M M M M M I M M I M M I M M M M M M M I M M M M M M M M M M M M 

Db 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 



Qy 



181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

M M I M M M M M M M M M M M M M I M M M M M M M M M M M M M M 



Db 



181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 



Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 


Qy 


1021 


Db 


1021 



GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TTCTTTGCA7WVCCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I M 
ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I M I I M I I I I I I I I I I I I I I I I I I M I I M I I I I M I M I M I I I I I I I I I I I I I I I I I 

TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 54 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I 11 I I I I I I I I I I I I I I I I I I I I I I I I I i 

CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 
ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I M M I I I M I I I I I I I I I 

ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 . 
CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I M I I I 

CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 7 8 0 

I I I I M I I I I I I M I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

I M M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M M I M I I I M I I I I I I I I I I I 

GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 

AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

I I I I I I I I I M I M I I I I I I I I I I I M M I I I I I I I I I I I M I I I I I I I I I I I M I I I M 

TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 108 0 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I 
1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 114 0 

1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 
1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I M I 

1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTA7UV 1440 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I 
1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I 
1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I 
1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

1741 TGA 1743 
I I I 

1741 TGA 1743 



RESULT 2 
AAH49207 

ID AAH49207 standard; cDNA; 1743 BP. 
XX 

AC AAH4 9207; 

XX 

DT 26-NOV-2001 (first entry) 



XX 

DE Human CHOT encoding cDNA. 
XX 

KW CHOT; human; choline transporter; chromosome 2qll-13; nootropic; 

KW neuroprotective; gene therapy; antisense therapy; degenerative disease; 

KW cognitive disorder; Alzheimer's disease; ss . 

XX 

OS Homo sapiens . 
XX 

PN DE10009055-A1. 
XX 

PD 30-AUG-2001. 
XX 

PF 28-FEB-2000; 2000DE-01009055 . 
XX 

PR 28-FEB-2000; 2000DE-01009055 . 
XX 

PA (BRUE/) BRUESS M. 

PA (BOEN/) BOENISCH H. 
XX 

PI Bruess Boenisch H; 
XX 

DR WPI; 2001-590709/67. 

DR P-PSDB; AAB86837. 
XX 

PT A new gene encoding human choline transporter, designated hCHOT is 

PT located on chromosome 2qll-13 and is useful to treat degenerative 

PT disorders such as /Alzheimer's disease. 

XX 

PS Disclosure; Page 11; 12pp; German. 
XX 

CC This invention describes a novel gene encoding human choline transporter, 

CC designated hCHOT which is located on chromosome 2qll-13. The products of 

CC the invention have nootropic and neuroprotective activity and can be used 

CC for gene or antisense therapy. (I) is used to treat degenerative disease, 

CC particularly cognitive disorders such as Alzheimer's disease. Sense and 

CC antisense oligonucleotides derived from the gene may be used in 

CC diagnostics and other techniques. This sequence encodes the human CHOT 

CC protein described in the invention 
XX 

SQ Sequence 1743 BP; 412 A; 393 C; 406 G; 532 T; 0 U; 0 Other; 

Query Match 100.0%; Score 1743; DB 5; Length 1743; 
Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1743; Conservative 0; Mismatches 0; Indels 0; Gaps 0 

Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 




Db 



1 



ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 



Qy 



61 



GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 




Db 



61 



GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 



Qy 



121 



GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 




Db 



121 



GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 



Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

I I I I I I I I I I M I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 8 1 ACCT GGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I M I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

Qy 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I 

Db 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 4 80 

Qy 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 54 0 

I I I I I I I I M I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I I I I I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I M I I I M I I I I I I I I M I I I I I I 

Db 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I 
Db 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 

Db 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 78 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

I I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 7 81 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 84 0 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I I I I I I I I M M I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I 

Db 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I 
Db 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 



Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I 

Db 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

I I I I I I I I I I I I I I I I I I M I I I I I M I I M I I I I M I I I I I I I I I I I I M I I I I M I I I 

Db 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I 
Db 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 132 0 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 132 0 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 144 0 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I M I I I I I I I I I M M I I I I I 
Db 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 14 40 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I M I I I I I I I I I M I I I I I I I 
Db 1501 CTATTTG7VAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I 
Db 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I II I I M I I I I I I I M I I I I I 

Db 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

I II I II I II I II II II I I II I II I II I II I I II II I II I II II 11 I M I I I I I I I I I I I I 
Db 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

Qy 1741 TGA 1743 

I I I 

Db 1741 TGA 1743 



RESULT 3 
ADD50638 

ID ADD50638 standard; cDNA; 1743 BP. 
XX 



AC ADD50638; 

XX 

DT 15-JAN-2004 (first entry) 
XX 

DE cDNA encoding human high-affinity choline transporter (hCHT) . 
XX 

KW Human; high-affinity choline transporter; hCHT; chromosome 2ql2; 

KW cholinergic function; Parkinson's disease; Huntington's disease; 

KW Alzheimer's disease; schizophrenia; dysautonomia; myasthenia gravis; 

KW brain; cholinergic signalling; antiparkinsonian; anticonvulsant; 

KW nootropic; neuroprotective; neuroleptic; gene; ss. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 1. .1743 

FT /*tag= a 

FT /product= "hCHT" 

XX 

PN US2003114399-A1. 
XX 

PD 19-JUN-2003. 
XX 

PF 23-JUL-2001; 2001US-00911077 . 
XX 

PR 23-JUL-2001; 2001US-00911077 . 
XX 

PA (BLAK/) BLAKELY R D. 

PA (APPA/) APPARSUNDARAM S. 

PA (FERG/) FERGUSON S. 

XX 

PI Blakely RD, Apparsundaram S^ Ferguson S; 
XX 

DR WPI; 2003-810914/76. 

DR P-PSDB; ADD50639, 
XX 

PT Novel isolated polynucleotide encoding human or mouse high affinity 

PT choline transporter polypeptide, useful in gene therapy to increase 

PT cholinergic function in a cell of a patient suffering from Alzheimer's 

PT disease. 
XX 

PS Claim 2; SEQ ID NO 1; 74pp; English. 
XX 

CC The present invention relates to the isolation of polynucleotide 

CC sequences encoding human and mouse high-affinity choline transporter 

CC (hCHT and mCHT respectively), and the proteins they encode. The'-gene 

CC encoding hCHT is located on chromosome 2ql2. The polynucleotide sequence 

CC encoding hCHT, is useful for expressing hCHT recombinantly . The hCHT 

CC polynucleotide sequence when delivered to a cell, increases cholinergic 

CC function in the cell that is in a patient having Parkinson's disease, 

CC Huntington's disease, Alzheimer's disease, schizophrenia, dysautonomia or 

CC myasthenia gravis. The hCHT antibody is useful for controlling 

CC transporter CHT proteins to the brain, and for treating the above 

CC mentioned diseases. The antibody is also useful for diagnosing the above 

CC mentioned disorders and to detect the influence of cholinergic 

CC signalling. The present sequence encodes hCHT. Note: The sequence data 

CC for this patent was obtained in electronic format directly from the USPTO 



CC web site at seqdata.uspto.gov. 
XX 

SQ Sequence 1743 BP; 412 A; 393 C; 406 G; 532 T; 0 U; 0 Others- 



Query Match 100.0%; Score 1743; DB 9; Length 1743; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1743; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I M I I I I I I I t I 
Db 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

Qy 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

Qy 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

I I M M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 42 0 

I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I M I I ! I I I I I I I I I I 
Db 361 ATCTATGG7W\ACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I 
Db 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

Qy 4 81 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 54 0 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I 

Db 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

M I I I I I I I I M i M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I 
Db 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I M 
Db 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I 
Db 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 7 80 



I I I I I I I I I I I I I I I I I I I I i I I I I M I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I 

Db 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 84 0 

M I I I I I I I I I I M I I I I I I I I I I I I I M I I I M I I I M I I I M I I I I I I I I I I I I I I I I 

Db 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 84 0 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I 

Db 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 114 0 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M 

Db 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 12 00 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 14 40 

I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 1381 TTCTACCCTGGCTATTACCCTGAi'GATAATGGTATATATAATCAGAAATTTGCATTTAAA 1440 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M 
Db 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 



Qy 



1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 
I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 



Qy 



1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 




Db 



1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 



Qy 



Db 




Qy 



1741 TGA 1743 



Db 



1741 TGA 1743 



RESULT 4 
ADD50646 

ID ADD50646 standard; DNA; 1813 BP. 
XX 

AC ADD50646; 
XX 

DT 15-JAN-2004 (first entry) 
XX 

DE High-affinity choline transporter (CHT) associated DNA sequence #2. 
XX 

KW High-affinity choline transporter; CHT; cholinergic function; 

KW Parkinson's disease; Huntington*s disease; Alzheimer's disease; 

KW schizophrenia; dysautonomia ; myasthenia gravis; brain; 

KW cholinergic signalling; antiparkinsonian; anticonvulsant; nootropic; 

KW neuroprotective; neuroleptic; ds . 

XX 

OS Unidentified. 
XX 

PN US2003114399-A1 . 
XX 

PD 19-JUN-2003. 
XX 

PF 23-JUL-2001; 2001US-00911077 . 
XX 

PR 23-JUL-2001; 2001US-00911077 . 
XX 

PA (BLAK/) BLAKELY R D, 

PA (APPA/) APPARSUNDARAM S. 

PA (FERG/) FERGUSON S. 

XX 

PI Blakely RD, Apparsundaram Ferguson S; 
XX 

DR WPI; 2003-810914/76. 
XX 

PT Novel isolated polynucleotide encoding human or mouse high affinity 

PT choline transporter polypeptide, useful in gene therapy to increase 

PT cholinergic function in a cell of a patient suffering from Alzheimer ' s 

PT disease, 
XX 

PS Disclosure; SEQ ID NO 9; 74pp; English. 
XX 

CC The present invention relates to the isolation of polynucleotide 



CC sequences encoding human and mouse high-affinity choline transporter 

CC (hCHT and mCHT respectively)^ and the proteins they encode. The gene 

CC encoding hCHT is located on chromosome 2ql2. The polynucleotide sequence 

CC encoding hCHT, is useful for expressing hCHT recombinantly . The hCHT 

CC polynucleotide sequence when delivered to a cell^ increases cholinergic 

CC function in the cell that is in a patient having Parkinson's disease, 

CC Huntington's disease, Alzheimer's disease, schizophrenia, dysautonomia or 

CC myasthenia gravis. The hCHT antibody is useful for controlling 

CC transporter CHT proteins to the brain, and for treating the above 

CC mentioned diseases. The antibody is also useful for diagnosing the above 

CC mentioned disorders and to detect the influence of cholinergic 

CC signalling. The present DNA sequence of unknown function is provided in 

CC the electronic sequence data but is not mentioned in the printed 

CC specification. Note: The sequence data for this patent was obtained in 

CC electronic format directly from the USPTO web site at seqdata.uspto.gov. 

XX 

SQ Sequence 1813 BP; 440 A; 406 C; 417 G; 550 T; 0 U; 0 Others- 
Query Match 100.0%; Score 1743; DB 9; Length 1813; 
Best Local Similarity 100.0%; Pred. No. 0; 

Matches 1743; Conservative 0; Mismatches 0; Indels 0; Gaps 0 
ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

I I I I I I I I I I M I I I I I M I I I I I I I I I M I I I I I I I I I I M I I I M I I I I I I I I I I I I I 

ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 78 

GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 
I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 138 

GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 

GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 198 

ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

M I I I I I I I I I I I I I I I I M I I I I I I I I 1 I I I I I I I I I I I I I I M I I I I I I I I I M I I I I 
ACCTGGGTCGGAGGAGGGTATATCT^TGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 258 

GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I M I I I I I I I I I I I I I I I I 

GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 318 

TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

I I I I I I I I I I I I I I I I i I I I I I I I I M I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I 
TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 378 

ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I 

ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 438 

TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 
I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I 
TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 498 

ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 54 0 
I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 558 



Qy 


1 


Db 


19 


Qy 


61 


Db 


79 


Qy 


121 


Db 


139 


Qy 


181 


Db 


199 


Qy 


241 


Db 


259 


Qy 


301 


Db 


319 


Qy 


361 


Db 


379 


Qy 


421 


Db 


439 


Qy 


481 


Db 


499 



Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

Db 559 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 618 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 619 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 678 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I 
Db 679 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 738 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 7 80 

I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I N I I I I I M I I I I I I I I I I I 
Db 739 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 798 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 84 0 

I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I 

Db 799 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 858 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I I i I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I M I I I I I I I i I I I I I I I I 
Db 859 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 918 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I 
Db 919 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAG7\AGAGGCAGACATGATTTTA 978 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I M 
Db 979 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1038 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

I I I M I M I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

Db 1039 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1098 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 114 0 

I M I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1099 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1158 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

I I N I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 1159 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1218 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I M I I M I I I I I I I 

Db 1219 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1278 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

I I I I I I I I I I M M I I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I 

Db 1279 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1338 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I M I I I M I I I I I I I I I I I I I 
Db 1339 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1398 



Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

M I I I I I I I I I I I I I M I M I I M M I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I 

Db 1399 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1458 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1459 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1518 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

M I I I I M I I I I I I I I M I I I I I I I I I I I M I I I I I I I I i I I I I I i I I I I I I M I I I I I I 

Db 1519 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1578 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

I I I I I I I I i I I I I I I I M I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 1579 CACAGTGAAGAAAACATGGATT^GACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1638 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

N I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 

Db 1639 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1698 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1699 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1758 

Qy 1741 TGA 1743 

III 

Db 1759 TGA 1761 



RESULT 5 
ABX94338 

ID ABX94338 standard; cDNA; 1743 BP. 
XX 

AC ABX94338; 
XX 

DT 13-JUN-2003 (first entry) 
XX 

DE Human cDNA encoding high affinity choline transporter, HACT . 
XX 

KW Human; ss; gene; flACT; high affinity choline transporter; pain; 

KW neurotransmitter biosynthesis; learning and memory; aging; epilepsy; 

KW neurological disorder; spasticity; myoclonus; muscle spasm; 

KW muscle hyperactivity; stroke; head trauma; neuronal cell death; 

KW multiple sclerosis; spinal chord injury; dystonia; Alzheimer's disease; 

KW Myasthenia Gravis; multi-inf arct dementia; AIDS dementia; 

KW Parkinson's disease; Huntington's disease; amyotrophic lateral sclerosis; 

KW ALS; attention deficit disorder; organic brain syndrome; schizophrenia; 

KW nicotine addiction; memory disorder; cognitive disorder. 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT CDS 1. .1743 

FT /*tag= a 

FT /product= "HACT" 

XX 

PN US6500643-B1. 



XX 

PD 31-DEC-2002. 
XX 

PF 07-SEP-2000; 2000US-00657252 . 
XX 

PR 07-SEP-2000; 2000US-00657252 . 
XX 

PA (UYFL ) UNIV FLORIDA. 
XX 

PI Wu D, Gu Y, Millard WJ, He Y; 
XX 

DR WPI; 2003-361535/34. 

DR P-PSDB; ABU08979. 

XX 

PT Novel isolated polynucleotide (I) that encodes high affinity choline 

PT transporter protein, useful for preventing, treating or ameliorating 

PT neurological and cognitive disorders such as Alzheimer's or Parkinson's 

PT disease. 
XX 

PS Claim 2; Col 17-21; 20pp; English. 
XX 

CC The invention relates to an isolated polynucleotide which encodes a high 

CC affinity choline transporter (HACT) protein appearing as ABU08979. Also 

CC included are a polynucleotide encoding a fragment consisting of at least 

CC about 50 amino acids of the HACT protein, a vector comprising the 

CC polynucleotide, a composition comprising a vector comprising a 

CC polynucleotide which comprises at least about 12 contiguous nucleic acids 

CC of a polynucleotide appearing as ABX94339 (encoding choline 

CC acetyltransferase) , a recombinant host cell which comprises the vector 

CC (used to express the HACT protein or fragment) The polynucleotide is 

CC useful as a probe or primer to detect the presence of HACT polynucleotide 

CC in a sample, such as a biological sample, or for screening for test 

CC agents which bind to the polynucleotide. A pharmaceutical composition 

CC comprising the polynucleotide is useful for preventing, treating or 

CC ameliorating neurological and cognitive disorders e.g. pain, spasticity, 

CC myoclonus, muscle spasm, muscle hyperactivity, epilepsy, stroke, head 

CC trauma, neuronal cell death, multiple sclerosis, spinal chord injury, 

CC dystonia, Alzheimer's disease, myasthenia gravis, multi- infarct 

CC dementia, AIDS dementia, Parkinson's disease, Huntington's disease, 

CC amyotrophic lateral sclerosis (ALS), attention deficit disorder, nicotine 

CC addiction, organic brain syndromes, schizophrenia or memory and cognitive 

CC disorders. HACT is thought to be the rate limiting step in cholinergic 

CC neurotransmitter biosynthesis and regeneration (cholinergic transmissions 

CC are crucial to brain functions such as learning and memory) . The present 

CC sequence encodes human HACT 

XX 

SQ Sequence 1743 BP; 411 A; 395 C; 405 G; 532 T; 0 U; 0 Other; 

Query Match 99.7%; Score 1738.2; DB 8; Length 1743; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 1740; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

I I N I M M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 



Qy 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 



Db 61 GTTGGAATATGGGCTGCCTGGAGAACCTW^AACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I M 

Db 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 18 0 

Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

I I M I I I I I I I I I I I I I I M I I M I I M I M I I I I I I I I I I I I M I I I I I I I I I I M I I I 

Db 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 24 0 

Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

Qy 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCT^A 360 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I M 
Db 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 4 80 

Qy 4 81 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I I I I I I I M 

Db 4 81 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I I M I I I I I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I I M I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I 
Db 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 7 80 

I I I I I I I I M I I I I I I I M M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I ! I I I M 

Db 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGG;WTCCCATGGC7\AGCATACTTTCAGAGG 780 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

M M I I M I I I M M M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I I I I M I I I I I I I I II I I I I I II I I 
Db 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCCTCCACAGACTGG 900 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I M I I I I I I I II I I I I II I I I II II I I I I I I I I I M I I I I I I I I I I I M I I I I I I I I I I I 



Db 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

I I I I I I I I I I I I I I I I I I I I M M I I I I I I I I I I M I I M I I I I I I I I I I I I I I I I I M I 
Db 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

I I M I I I I I I I I I M I I I I I I I I I I I M I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I 
Db 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I 

Db 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

I I I I I I M I I I M I I I I I I I M I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I 
Db 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I 
Db 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 138 0 

I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

I I I I I I I I I I I I I I I I I I I I I I I I I M I ( I I I I I I I I I I I I I I I I M I I I I I I I I I I I M 
Db 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I M I I I I I I I M I I I I M I I I 

Db 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTATU^TTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I M I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I M I I I I 

Db 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCATVAAATGAAAATATTAAATTAGAT 1620 

I I I I I I I I N I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I M I I I I I I 
Db 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTTW^TTAGAT 1620 

... • ' 'u 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I I I I I I I I I I I I I i I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I 

Db 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 168 0 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

M I I I M I I I I I I I I I I I I I i I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAA 1740 

Qy 1741 TGA 1743 

I I I 

Db 1741 TGA 1743 



RESULT 6 
AAF81711 

ID AAF81711 standard; cDNA; 1743 BP. 
XX 

AC AAF81711; 
XX 

DT Ol-JUN-2001 (first entry) 
XX 

DE Rat high affinity choline transporter protein encoding cDNA. 
XX 

KW High affinity choline transporter; cho-1; Alzheimer's disease; diagnosis 

KW ss . 

XX 

OS Rattus norvegicus. 

XX 

FH Key Location/Qualifiers 
FT CDS 1. .1743 

FT /*tag= a 

FT /product= "high affinity choline transporter" 

XX 

PN WO200116315-A1. 

XX 

PD 08-M7VR-2001. 
XX 

PF 18-AUG-2000; 2000WO- JP005545 . 
XX 

PR 27-AUG-1999; 99 JP- 0024 0642 . 
PR 27-DEC-1999; 99 JP-00368991 . 
XX 

PA (NISC-) JAPAN SCI & TECHNOLOGY CORP. 
XX 

PI Haga T, Okuda T; 
XX 

DR WPI; 2001-226688/23. 
DR P-PSDB; AAB74664. 
XX 

PT New rat and human spinal cord high affinity choline transporters^ useful 
PT in diagnosis of Alzheimer's disease and screening promoters as drugs for 
PT treating Alzheimer's disease. 
XX 

PS Claim 6; Page 64-68; 90pp; Japanese. 
XX 

CC The present sequence encodes a rat (Rattus norvegicus) high affinity 
CC choline transporter protein designated cho-1. The cho-1 protean has 
CC nootropic and neuroprotective activities. The cho-1 polynucleotide and 
CC protein can be used for the diagnosis of diseases related to the 
CC expression of cho-1 by comparing the cho-1 polynucleotide sequence in a 
CC sample to that of a control. Drug compositions containing the cho-1 
CC protein or expression promoters or inhibitors of cho-1 are useful for 
CC treating disorders characterised by abnormal levels of cho-1, such as 
CC Alzheimer's disease 
XX 

SQ Sequence 1743 BP; 414 A; 402 C; 404 G; 523 T; 0 U; 0 Other; 



Query Match 



80.0%; Score 1394.2; DB 4; Length 1743; 



Best Local Similarity 87.5%; Pred. No. 0; 

Matches 1525; Conservative 0; Mismatches 218; Indels 0; Gaps 0 
Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

I I I I I I I I I I I I I I I I I I I I I I I I I M III I I I M I I II II I I II II III 

Db 1 ATGCCTTTCCATGTAGAAGGACTAGTAGCGATTATCCTGTTCTACCTTCTTATATTTCTG 60 

Qy 61 GTTGGAATATGGGCTGCCTGGAG7\ACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

M I I II I I II I II I II I I I II I I I I I I I I I I I I II I I I I I I I I I II I I I II II 
Db 61 GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGTAATGCAGAAGAACGCAGCGAA 120 

Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

Db 121 GCCATCATAGTTGGGGGCCGAGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 180 

Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

M I I I I I I I I I I I I I I II I I I I I II I I I I M I M I I I I I I I I I I I M I I I I I 
Db 181 ACCTGGGTTGGAGGAGGTTACATCAACGGGACAGCTGAAGCAGTTTATGGGCCAGGTTGT 240 

Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

II I I I I I I I I I I I I I I I M I M M I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I 

Db 241 GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTGATTTTAGGTGGCCTG 300 

Qy 301 TTCTTTGC7W\ACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

II I I I I I I I I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I I I I II II 
Db 301 TTTTTTGCAAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCGTTTCAACAG 360 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I M I I I M I I M I I I I M II II I II I I II I I I I I I II I II I II I I I I I I I I I 

Db 361 ATCTATGGAAAGCGCATGGGTGGGCTGCTGTTCATCCCTGCACTGATGGGAGAGATGTTC 420 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I I I I I I I i I M I I I I I M I I I I II II II I I I I M I I II I I I I I I I I I I I II I I 
Db 421 TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCTACCATCAGCGTAATCATTGATGTGGAT 4 80 

Qy 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 54 0 

N MM II I II II II II I I I II II II II II III M II II M M I II II 

Db 481 GTGAACATATCGGTCATTGTCTCCGCACTCATTGCCATTCTTTATACCCTCGTGGGAGGG 540 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I M M M II II II II II II II I I II II I II II II I II II M MM Mill 

Db 541 CTCTACTCTGTGGCATATACTGATGTTGTACAGCTATTCTGCATTTTTATAGGATTGTGG 600 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

M M I Mill II II I II M I II II II II II I I I II II I II II II II I I M M 

Db 601 ATCAGTGTCCCATTTGCCCTGTCACATCCTGCAGTCACCGACATTGGATTCACTGCTGTG 660 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I M M II II I II I I II II II II II II I II M II I II I M M I M I II I 

Db 661 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 72 0 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

I I M I M II II II M II II II M II II I I M I II M M II Mill II M II 

Db 721 CTTGATAATTTTCTGTTGTTGATGCTGGGTGGAATACCATGGCAAGCCTACTTCCAGAGG 780 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

II M M II II II Mill II II II M M I I I I M M II I M M II II II M I III 



Db 781 GTCCTCTCTTCATCGTCAGCGACCTATGCTCAGGTGCTGTCCTTCCTGGCAGCTTTTGGG 84 0 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I I I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II M I I I I I I I 
Dt> 841 TGCCTGGTGATGGCTCTACCAGCCATTTGCATTGGGGCCATTGGAGCCTCCACAGACTGG 900 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

Mill M I II I I I II II II II II I I I I I I I II I I II M I I I M I I I I I II I 

Db 901 AACCAAACTGCATATGGGTTTCCAGATCCCAAGACCAAGGAGGAAGCAGACATGATTCTC 960 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

M II I II II I II II I I I I I I I I II I I I II I I I I I II I II I I I I I I I M I I III 
Db 961 CCGATTGTTCTACAGTACCTCTGCCCTGTGTACATTTCCTTCTTTGGGCTTGGTGCTGTT 1020 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

M I II I I I II I I I I II II II II II II I I I I I I M I I I II I I I I I I II I I II 

Db 1021 TCTGCTGCTGTCATGTCCTCGGCTGACTCATCCATCCTATCAGCAAGTTCCATGTTTGCT 1080 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I II I I I I I I I II I II I I I I I 
Db 1081 CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 114 0 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

Ml I II I I I II I I I M I I I M I I I II I I I II I I I I I I II I I I I I I II I I I I I I I I 

Db 1141 ATGAGGATCACTGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTCACGAAG 1200 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

M I M I I I I I I II I I II M I II I II I I I I I I M II I I I I I I I I I I II II I I III 
Db 1201 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1260 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 132 0 

Mill I I I I I I I M I I I II I I I I I I I I M I I I I II I I I II II I I I I I I II 
Db 1261 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1320 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I I M II I I I II I M I II II I I I I I I I I I I I I I I I I II I I N I II I I III 

Db 1321 TTTGGACTTTTCCTGAGAATTACCGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1380 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 144 0 

M I M I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I II I I I M II I I 

Db 1381 TTCTACCCTGGTTATTACCCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 144 0 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCC/^GTAT 1500 

II M I I I I M I I I I I 1 I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I II I M I I I 

Db 1441 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCCTATCTAGCCAAGTAT 1500 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

M I M I II I I I I I I I I I I I I I I I II I I I II I I I I II I I I I I I I I I I I I I I I I I I 
Db 1501 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATATATTTGATGCTGTTGTCTCAAGG 1560 

Qy 1561 CACAGTG7\AGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

M II II I I I I I I I I M I I I I I I I I I I I I I MM M M M I I I M II I I II M 
Db 1561 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAACATCAAATTAAAT 1620 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 168 0 

M M I I I I I I III I I I I I I I I I I I I I I I II I I I II I II I I II I I I II I I M I II 

Db 1621 GAACTTGCACCTGTAAAGCCTCGACAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAA 168 0 



Qy 



1681 



GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 





Db 



1681 



GAGGCTCTCCTTGATGTTGATTCCAGTCCAGAGGGATCTGGGACTGAAGATAACTTACAA 1740 



Qy 



1741 



TGA 1743 



Db 



1741 



TGA 1743 



RESULT 7 
ADD50642 

ID ADD50642 standard; cDNA; 4904 BP. 
XX 

AC ADD50642; 
XX 

DT 15-JAN-2004 (first entry) 
XX 

DE cDNA encoding rat high-affinity choline transporter (rCHT) . 
XX 

KW Rat; high-affinity choline transporter; rCHT; cholinergic function; 

KW Parkinson's disease; Huntington's disease; Alzheimer's disease; 

KW schizophrenia; dysautonomia ; myasthenia gravis; brain; 

KW cholinergic signalling; antiparkinsonian; anticonvulsant; nootropic; 

KW neuroprotective; neuroleptic; gene; ss. 

XX 

OS Rattus sp. 
XX 

FH Key Location/Qualifiers 

FT CDS 224. .1966 

FT /*tag= a 

FT /product^ "rCHT" 

XX 

PN US2003114399-A1. 
XX 

PD 19-JUN-2003. 
XX 

PF 23-JUL-2001; 2001US-00911077 . 
XX 

PR 23-JUL-2001; 2001US-00911077 . 
XX 

PA (BLAK/) BLAKELY R D. 

PA (APPA/) APPARSUNDARAM S. 

PA (FERG/) FERGUSON S. 

XX 

PI Blakely RD^ Apparsundaram Ferguson^ S; 

XX 

DR WPI; 2003-810914/76. 
DR P-PSDB; ADD50643. 
XX 

PT Novel isolated polynucleotide encoding human or mouse high affinity 
PT choline transporter polypeptide^ useful in gene therapy to increase 
PT cholinergic function in a cell of a patient suffering from Alzheimer's 
PT disease. 
XX 

PS Example 1; SEQ ID NO 5; 74pp; English. 
XX 



CC The present invention relates to the isolation of polynucleotide 

CC sequences encoding human and mouse high-affinity choline transporter 

CC (hCHT and mCHT respectively), and the proteins they encode. The gene 

CC encoding hCHT is located on chromosome 2ql2 . The polynucleotide sequence 

CC encoding hCHT, is useful for expressing hCHT recombinantly . The hCHT 

CC polynucleotide sequence when delivered to a cell, increases cholinergic 

CC function in the cell that is in a patient having Parkinson's disease, 

CC Huntington's disease, Alzheimer's disease, schizophrenia, dysautonomia or 

CC myasthenia gravis. The hCHT antibody is useful for controlling 

CC transporter CHT proteins to the brain, and for treating the above 

CC mentioned diseases. The antibody is also useful for diagnosing the above 

CC mentioned disorders and to detect the influence of cholinergic 

CC signalling. The present sequence encodes rat CHT (rCHT) . Note: The 

CC sequence data for this patent was obtained in electronic format directly 

CC from the USPTO web site at seqdata.uspto.gov. 

XX 

SQ Sequence 4904 BP; 1447 A; 991 C; 939 G; 1527 T; 0 U; 0 Others- 



Query Match 80.0%; Score 1394.2; DB 9; Length 4904; 

Best Local Similarity 87.5%; Pred. No. 0; 

Matches 1525; Conservative 0; Mismatches 218; Indels 0; Gaps 0 
Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

Ml I I I I I I I I I I I I I I M I I I I I I II III I I II I I I I II II I II II III 

Db 224 ATGCCTTTCCATGTAGAAGGACTAGTAGCGATTATCCTGTTCTACCTTCTTATATTTCTG 283 

Qy 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGTyV 120 

I I I I I M II I I M II I I I I I I I I I II I I I I I I I II I I I M I I I I I I II I II II 
Db 284 GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGTAATGCAGAAGAACGCAGCGAA 343 

Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

M I II I II I I I I I I I II I I I I I I I I I I I II I I I II II I I I I I I II II I I I I II I I 

Db 34 4 GCCATCATAGTTGGGGGCCGAGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 4 03 

Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

I M I I I II II I I II 11 II I I I I I II I I I I I I II II I I I I I I I I I II I II II I 

Db 404 ACCTGGGTTGGAGGAGGTTACATCAACGGGACAGCTGAAGCAGTTTATGGGCCAGGTTGT 463 

Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

M I I I II I I I I I I I I M I II I I I I I M I I II I I I II I I II I II I I I I I I I I II I I I I 

Db 4 64 GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTGATTTTAGGTGGCCTG 523 

Qy 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

II I I I II I I I I I I I I I I I I I .1 I I I I M I I II 1 I I I I I I I II I I I I I I I I I I I II 

Db 524 TTTTTTGCAAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCGTTTCAACAG 583 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I I M I I M I II I I I I I I I I n I I I M II II I I I I I I I I I I M I II I I I I I I I I 

Db 584 ATCTATGGAAAGCGCATGGGTGGGCTGCTGTTCATCCCTGCACTGATGGGAGAGATGTTC 643 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I I I I I I I I I I I I I M I I I I I I I II II II II I II I I I I I I I I I I I I I I I II M I 
Db 644 TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCTACCATCAGCGTAATCATTGATGTGGAT 703 

Qy 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 54 0 

N MM II Mill MM I II I II M II II I III II II II II I II I I II 

Db 7 04 GTGAACATATCGGTCATTGTCTCCGCACTCATTGCCATTCTTTATACCCTCGTGGGAGGG 763 



Qy 


541 


Db 


764 


Qy 


601 


Db 


824 


Qy 


661 


Db 


884 


Qy 


721 


Db 


944 


Qy 


781 


Db 


1004 


Qy 


841 


Db 


1064 


Qy 


901 


Db 


1124 


Qy 


961 


Db 


1184 


Qy 


1021 


Db 


1244 


Qy 


1081 


Db 


1304 


Qy 


1141 


Db 


1364 


Qy 


1201 


Db 


1424 


Qy 


1261 


Db 


1484 


Qy 


1321 


Db 


1544 



CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I I I I I I I I I I I I I II I I II I I M II I I I I I II I I I I I M I I I I I I I I I I I 

CTCTACTCTGTGGCATATACTGATGTTGTACAGCTATTCTGCATTTTTATAGGATTGTGG 823 

ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

Mill Ml 11 lllllllllllllllll I II IIIIIIMIIM 

ATCAGTGTCCCATTTGCCCTGTCACATCCTGCAGTCACCGACATTGGATTCACTGCTGTG 883 

CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGTVAGTCTACTCTTGG 720 

M I II I I I I I I II I I I I I I I I I I I I M I I II I I I I I I 11 I I I I I I I I I 

CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 943 
CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

II MIMM INI Mllllll II Ill 

CTTGATAATTTTCTGTTGTTGATGCTGGGTGGAATACCATGGCAAGCCTACTTCCAGAGG 1003 
GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

M I M II I I I II I I I I I I I I II I I I I I I I I M I M I i I I I I I I I II I I I II III 

GTCCTCTCTTCATCGTCAGCGACCTATGCTCAGGTGCTGTCCTTCCTGGCAGCTTTTGGG 1063 
TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

M M I M I I I I I I I I Mllllll I I I I I I I M Ill II I I I I I I I M 

TGCCTGGTGATGGCTCTACCAGCCATTTGCATTGGGGCCATTGGAGCCTCCACAGACTGG 1123 



MM II I II M I M II II I II I II I I I I I I I I I 11 II II I II I I II I M I 



CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 
M II II II I I I I I I I M II I II II II I II II I II I II I I I I I Mllllll III 
CCGATTGTTCTACAGTACCTCTGCCCTGTGTACATTTCCTTCTTTGGGCTTGGTGCTGTT 1243 



II M II I M I I II I II II II II II I I I I I I I II I I II I I M I I I I I II II I 



CGGAACATCTACCAGCTTTCCTTCAGACAAT^ATGCTTCGGACAAAGAAATCGTTTGGGTT 114 0 
Mill M II I I I M II II II I I II I I I I I I I I I I II II II I I I I I I II I I I II 
CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 1363 

ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 
Ml I I I II I II I II II I I I I II M I M II I I I I I II II II M I I I I M I I I I I I I 
ATGAGGATCACTGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTCACGAAG 142 3 

ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG.. 1260 
M II I I I I I I I I I I I I I I I I I I I II lllllllllll llllll I Mllllll III 
ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1483 

CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 132 0 
Mill lllllllllll I II lllllllllll I I M I I I I II II I I I M I II 
CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1543 

TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 
I Ml II lllllllllll II I I I I I lllllllllll II I II I I I I II III 
TTTGGACTTTTCCTGAGAATTACCGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1603 



Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 144 0 

I I I I M I M I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1604 TTCTACCCTGGTTATTACCCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 1663 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I II I 

Db 1664 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCCTATCTAGCCAAGTAT 1723 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I I I M M I M I I M I I M I I I II I I I II I II I M I II I I I I I I I II I I I II I 

Db 1724 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATATATTTGATGCTGTTGTCTCAAGG 1783 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

I I I M I I I I I I I I I I I I I I Mill I I I I I I I I I I I I II I I I I II II I I II M 
Db 1784 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAACATCAAATTAAAT 1843 

Qy 1621 GAACTTGCACTTGTGT^AGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 168 0 

M I I M I I M III I I I I I II I I I I I I I I M M II II I I I I I I I I I I I I I 

Db 1844 GAACTTGCACCTGTAAAGCCTCGACAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAA 1903 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

Mill II M II I II II I M II M II I M II II II I M II M M M I I I I Mill 

Db 1904 GAGGCTCTCCTTGATGTTGATTCCAGTCCAGAGGGATCTGGGACTGAAGATAACTTACAA 1963 

Qy 1741 TGA 1743 

I I I 

Db 1964 TGA 1966 



RESULT 8 
ADD50640 



ID ADD50640 standard; cDNA; 1743 BP. 
XX 

AC ADD5064 0; 
XX 

DT 15-JAN-2004 (first entry) 
XX 

DE cDNA encoding mouse high-affinity choline transporter (mCHT) #1. 
XX 

KW Mouse; high-affinity choline transporter; mCHT; cholinergic function; 

KW Parkinson's disease; Huntington's disease; Alzheimer's disease; 

KW schizophrenia; dysautonomia; myasthenia gravis; brain; 

KW cholinergic signalling; antiparkinsonian; anticonvulsant; nootropic; 

KW neuroprotective; neuroleptic; gene; ss. 

XX 

OS Mus sp. 
XX 

FH Key Location/Qualifiers 

FT CDS 1. ,1743 

FT /*tag= a 

FT /product= "mCHT #1" 

XX 

PN US2003114399-A1. 
XX 

PD 19-JUN-2003. 
XX 

PF 23-JUL-2001; 2001US-00911077 . 



23-JUL-2001; 2001US-00911077 . 

(BLAK/) BLAKELY R D. 
(APPA/) APPARSUND7VRAM S. 
(FERG/) FERGUSON S. 

Blakely RD, Apparsundaram Ferguson S; 

WPI; 2003-810914/76. 
P-PSDB; ADD50641. 

Novel isolated polynucleotide encoding human or mouse high affinity 
choline transporter polypeptide, useful in gene therapy to increase 
cholinergic function in a cell of a patient suffering from Alzheimer's 
disease . 

Claim 30; SEQ ID NO 3; 74pp; English. 

The present invention relates to the isolation of polynucleotide 
sequences encoding human and mouse high-affinity choline transporter 
(hCHT and mCHT respectively), and the proteins they encode. The gene 
encoding hCHT is located on chromosome 2ql2. The polynucleotide sequence 
encoding hCHT, is useful for expressing hCHT recombinantly . The hCHT 
polynucleotide sequence when delivered to a cell, increases cholinergic 
function in the cell that is in a patient having Parkinson's disease, 
Huntington's disease, Alzheimer's disease, schizophrenia, dysautonomia or 
myasthenia gravis. The hCHT antibody is useful for controlling 
transporter CRT proteins to the brain, and for treating the above 
mentioned diseases. The antibody is also useful for diagnosing the above 
mentioned disorders and to detect the influence of cholinergic 
signalling. The present sequence encodes mCHT. Note: The sequence data 
for this patent was obtained in electronic format directly from the USPTO 
web site at seqdata.uspto.gov- 

Sequence 1743 BP; 406 A; 409 C; 410 G; 518 T; 0 U; 0 Other; 

Query Match 78.9%; Score 1375; DB 9; Length 1743; 

Best Local Similarity 86.8%; Pred. No, 0; 

Matches 1513; Conservative 0; Mismatches 230; Indels 0; Gaps 0; 

Qy 1 ATGGCTTTCCATGTGGT^AGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

Ml I I I I I I I I I I I I I I M I I I I I M I I I I III I I I II II II II II II III 
Db 1 ATGCCTTTCCATGTGGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 60 

Qy • 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

M I II II I II II I II II I I I I M I I I I I I I I I I I I II I I M I I I M I I I II III 
Db 61 GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGCAACCCAGAAGAGCGCAGTG7VA 120 

Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 18 0 

II I I II I I I M II I I I I I II I I I I I II I I II I M I I I M I I I I I I I I I I I II I 

Db 121 GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 18 0 

Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

I I I I I I I I I II I I I I I II I I I I I I M I I I M I I I I I II I MM I II II I I I 
Db 181 ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGAAGCAGTGTATGGGCCAGGTTGT 240 



XX 
PR 
XX 
PA 
PA 
PA 
XX 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 
PT 
XX 
PS 
XX 
CC 

cc 

CC 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

M I I I I I I I M I I I I I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I IN 

Db 241 GGTCTAGCTTGGGCTCATGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 300 

Qy 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

II I I I I I II I I II I I I I I I II I I I I I M II I M I I I M II I I I I I II I II 
Db 301 TTTTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTCAAACAG 360 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 42 0 

I I I II I I I I I I I I II I I I I M II II II II I I I M I I II II I I I II I II I I II 
Db 361 ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGAGAGATGTTC 420 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I I I I I I I I I I I M I I I I I I I I I I II II I I I II I I I I I I I I 11 I I I I I I M I I I I I I 

Db 421 TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCCACCATCAGCGTGATCATTGATGTGGAT 480 

Qy 4 81 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 54 0 

II MM II Mill II II M I I I I I II I I II I III II II II MM! Ill 

Db 4 81 GTGAACATATCGGTCATTGTCTCTGCACTCATTGCCATTCTTTATACCCTAGTGGGTGGG 540 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

II I I I I M I I II I II I I I I I I I I II I I I II II II I I I I I I I II I I I I I I I I 

Db 541 CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTTTTATAGGACTGTGG 600 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I II II II I II Mill II II M II I I II I I II I I II I I II II II I II I I I I I I 
Db 601 ATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAGCTGTG 660 

Qy 661 CATGCCAAATACC7\AAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I I I I I II II I I I I I II I I I I I I I I I I I MM III I I I I II I I M I Ml 

Db 661 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 720 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

M M II I II I I M II I I II I II II I I I II I I II II II II I I II II I I II I I II I II 
Db 721 CTTGATAATTTTCTGTTATTGATGCTGGGTGGAATCCCATGGCAAGCCTACTTCCAGAGG 780 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

II I I I I I II I II II I I II II I II II I I II I II I II II II II I I I II I II M I Ml 

Db 781 GTCCTCTCTTCATCCTCAGCCACCTATGCTCAGGTACTGTCCTTCCTGGCAGCTTTTGGG 840 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I II I II II II II I I II I II II I III II II II II II II II II M II I II 

Db 841 TGCCTGGTGATGGCTCTACCCGCCATATGCATAGGAGCTATTGGAGCTTCCACAGACTGG 900 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

ii'l II M M II M III I I II I I II II M II II I II II II II I I Ml MM 

Db 901 AACCAGACTGCCTACGGGTATCCAGATCCCAAGACTAAGGAGGAAGCAGACATGATTCTC 960 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II II M II M I II II II II M II M II I I II M I II I II II II II I II I III 

Db 961 CCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGGCTTGGTGCTGTT 1020 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

II M M M II Mill Mill II II II I II I II II II Mill II M I II I 
Db 1021 TCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGTTCTATGTTTGCT 1080 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 114 0 



I I I I I I I I I M I I I I I I I I I M M I M I I M I I I II I I I I I I II I I II I I I I I 

Db 1081 CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 1140 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

III I I I I I I Ml I I II I II I I I I I I I I I I II I II I I I M I I I I I I I I I I I I I I 

Db 1141 ATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGCCATGGCTTTGCTGACGAAG 12 00 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

M M M I I I I M I I I I II I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I III 
Db 1201 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1260 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

I I I I I I I I I II II I I I I II I I I I I I I I I I I I I I I I I I I II II I I I I I I I I 
Db 1261 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1320 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I III II I I I M I I I I I I M I I I II I I I M M I M I I II I I I I I II M III 

Db 1321 TTTGGACTATTCCTGAGAATTACTGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 138 0 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 144 0 

I I I I I I I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 1381 TTCTACCCTGGTTATTACTCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 144 0 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

M II I I I I I I I II I I M I I I I I M II I I I I I I I I II I M I M II I I I I I I I 

Db 1441 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCTTATCTAGCCAAGTAT 1500 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I I I I I I I I I I I II I I I I I I I I II I I I II I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 1501 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATGTATTTGATGCTGTTGTCGCAAGG 1560 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

I I I I I I I M I I M I I I I I I I I I I I Mill II II M II I I I II I II I M M I M 

Db 1561 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAATATCAAATTAAAT 1620 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I II I II I II I I II II I II II I II II I I II II I II I I I I II II I II II II I I I 

Db 1621 GAACTTGCACCTGTGAAACCTCGGCAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAG 1680 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

II I II I II II II II I II M M II II M I II II II I II I II I I M II I II M II II I 

Db 1681 GAGGCCCTCCTTGATGTTGATTCCAGTCCGGAGGGGTCTGGGACTGAAGATAATTTACAA 1740 

Qy 1741 TGA 1743 

I I I 

Db 1741 TGA 1743 . 



RESULT 9 
ADD50660 

ID ADD50660 standard; cDNA; 1743 BP. 
XX 

AC ADD50660; 
XX 

DT 15-JAN-2004 (first entry) 
XX 

DE cDNA encoding mouse high-affinity choline transporter (mCHT) #2. 



XX 

KW Mouse; high-affinity choline transporter; mCHT; cholinergic function; 

KW Parkinson*s disease; Huntington's disease; Alzheimer's disease; 

KW schizophrenia; dysautonomia; myasthenia gravis; brain; 

KW cholinergic signalling; antiparkinsonian; anticonvulsant; nootropic; 

KW neuroprotective; neuroleptic; gene; ss. 

XX 

OS Mus sp, 

XX 

FH Key Location/Qualifiers 

FT CDS 1. .1743 

FT /*tag= a . 

FT /product= "mCHT #2" 

XX 

PN US2003114399-A1. 

XX 

PD 19-JUN-2003. 
XX 

PF 23-JUL-2001; 2001US-00911077 . 
XX 

PR 23-JUL-2001; 2001US-00911077 . 
XX 

PA (BLAK/) BLAKELY R D. 

PA (APPA/) APPARSUNDARAM S. 

PA (FERG/) FERGUSON S. 

XX 

PI Blakely RD, Apparsundaram S, Ferguson S; 
XX 

DR WPI; 2003-810914/76. 

DR P-PSDB; ADD50661. 
XX 

PT Novel isolated polynucleotide encoding human or mouse high affinity 

PT choline transporter polypeptide^ useful in gene therapy to increase 

PT cholinergic function in a cell of a patient suffering from Alzheimer ' s 

PT disease. 
XX 

PS Example 4; SEQ ID NO 23; 74pp; English. 
XX 

CC The present invention relates to the isolation of polynucleotide 

CC sequences encoding human and mouse high-affinity choline transporter 

CC (hCHT and mCHT respectively), and the proteins they encode. The gene 

CC encoding hCHT is located on chromosome 2ql2. The polynucleotide sequence 

CC encoding hCHT, is useful for expressing hCHT recombinantly . The hCHT 

CC polynucleotide sequence when delivered to a cell, increases cholinergic 

CC function in the cell that is in a patient having Parkinson's disease, 

CC Huntington's disease, Alzheimer's disease, schizophrenia, dysautonomia or 

CC myasthenia gravis. The hCHT antibody is useful for controlling 

CC transporter CHT proteins to the brain, and for treating the above 

CC mentioned diseases. The antibody is also useful for diagnosing the above 

CC mentioned disorders and to detect the influence of cholinergic 

CC signalling. The present sequence encodes mCHT. Note: The sequence data 

CC for this patent was obtained in electronic format directly from the USPTO 

CC web site at seqdata.uspto.gov. 

XX 

SQ Sequence 1743 BP; 406 A; 409 C; 410 G; 518 T; 0 U; 0 Other; 



Query Match 



78.9%; Score 1375; DB 9; Length 1743; 



Best Local Similarity 86.8%; Pred. No. 0; 

Matches 1513; Conservative 0; Mismatches 230; Indels 0; Gaps 0; 
Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

III I II I I I I II I II I I I I I I I I I II I I I I Ml I I I I I I I II II II II III 

Db 1 ATGCCTTTCCATGTGGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 60 

Qy 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

I I I I I I II I II M I I I I I I I I I I I I I I I I I I II MM I II I M II M II II III 

Db 61 GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGCAACCCAGAAGAGCGCAGTGAA 120 

Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

II I M I II I II M Mill II I II II M I II II II II I II I II II M II II M I 

Db 121 GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 180 

Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

II II II II II II II M II II II M II II II I I II II M I MM II II I II I 
Db 181 ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGAAGCAGTGTATGGGCCAGGTTGT 24 0 

Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

M II II II II II II II Mill I I M II II M i II I Mill II II II II I II Ml 

Db 241 GGTCTAGCTTGGGCTCATGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 300 

Qy 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

II II II I II II II M II II I I II II I II II II I I II II II I II II II I M 
Db 301 TTTTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTCAAACAG 360 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

M I M II II II II II II II M II M II M I I II II II II M II M I MUM 

Db 361 ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGAGAGATGTTC 420 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

II I II I II II II II II M II M I II II II II II I II II II II I II II II II II I II 
Db 421 TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCCACCATCAGCGTGATCATTGATGTGGAT 4 80 

Qy 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

II II II II II II I II II II II II II II II I I III II M M II II I III 

Db 481 GTGAACATATCGGTCATTGTCTCTGCACTCATTGCCATTCTTTATACCCTAGTGGGTGGG 540 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

II I II II II II II II I M I II II II II II I II II II II I II II II MUM 
Db 541 CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTTTTATAGGACTGTGG 600 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

II I II Mill II II I II M II II II II II II I I II II II II II II I II II II 

Db 601 ATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAGCTGTG 660 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

M I II M M II II I II I II I II II I II II M I M II I II I II M I I II 

Db 661 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 720 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

II I M M M II II II I II II M I I II II II II I II II II II I II I II II I II II II 

Db 721 CTTGATAATTTTCTGTTATTGATGCTGGGTGGAATCCCATGGCAAGCCTACTTCCAGAGG 7 80 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

II I II II II I II II II M II M II II M II II I II M II II II II II II II I III 



Db 781 GTCCTCTCTTCATCCTCAGCCACCTATGCTCAGGTACTGTCCTTCCTGGCAGCTTTTGGG 840 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

I I I I I I I I I I I I I I I II I I II II III II II I I I I I I I I II I I I I I I I I I 
Db 841 TGCCTGGTGATGGCTCTACCCGCCATATGCATAGGAGCTATTGGAGCTTCCACAGACTGG 900 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I I I I I II I I II M III I II I I I I I I I I I I M M II II I I I I II I I I I II I 

Db 901 AACCAGACTGCCTACGGGTATCCAGATCCCAAGACTAAGGAGGAAGCAGACATGATTCTC 960 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II II I I I I I I I I I I I I I I I I I I I I I II I I II II I M I I I I I I I I I I I I I III 

Db 961 CCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGGCTTGGTGCTGTT 1020 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

II I I I II II I I II II I I I I I II II I I I I I I II M II Mill I I I I I I I I 

Db 1021 TCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGTTCTATGTTTGCT 1080 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

I I I I I I II I I I I I II I I I I I I I II I II I I I II II II I I I I I I I I I I II I I I I I 

Db 1081 CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 1140 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

III I I I I I I III I I I I I I I I I I I II I II I I I I II I M I I I I I I I I I I I I II I } 

Db 1141 ATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGCCATGGCTTTGCTGACGAAG 1200 

Qy 12 01 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 12 60 

I M I I I I I I I I I I I I I I I II I I I II I I I I I I I M I I I I I I II I I I I I I I I I III 

Db 1201 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1260 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

I I II I I I I M I I M I I I II I I I I I I II II I I I I I I I I I II II I I I I I I II 

Db 1261 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1320 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I III II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I 1 I 

Db 1321 TTTGGACTATTCCTGAGAATTACTGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 138 0 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

I I I I I I I M II I I I I M II II I I I I I I I I I I I I I I M I I I II I I I II I I I I 

Db 1381 TTCTACCCTGGTTATTACTCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 1440 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

II II M I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I M I I I I I I I I 

Db 1441 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCTTATCTAGCCAAGTAT 1500 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I I I I I M I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1501 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATGTATTTGATGCTGTTGTCGCAAGG 1560 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 

Db 1561 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAATATCAAATTAAAT 1620 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I I I I I I I I I I I I I I I I M II I I I I I I I I I I I M II I I I I I II I I I I I I I I I I 

Db 1621 GAACTTGCACCTGTGAAACCTCGGCAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAG 168 0 



1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 





Db 



1681 GAGGCCCTCCTTGATGTTGATTCCAGTCCGGAGGGGTCTGGGACTGAAGATAATTTACAA 1740 



Qy 



1741 TGA 1743 



Db 



1741 TGA 1743 



RESULT 10 
AAF81713 

ID AAF81713 standard; cDNA; 1743 BP. 
XX 

AC AAF81713; 
XX 

DT Ol-JUN-2001 (first entry) 
XX 

DE Mouse high affinity choline transporter protein encoding cDNA. 
XX 

KW High affinity choline transporter; cho-1; Alzheimer's disease; diagnosis; 

KW ss . 

XX 

OS Mus musculus. 

XX 

FH Key Location/Qualifiers 

FT CDS 1. .1743 

FT /*tag= a 

FT /product= "high affinity choline transporter" 
XX 

PN WO200116315-A1. 

XX 

PD 08-MAR-2001, 
XX 

PF 18-AUG-2000; 2000WO- JP005545 . 
XX 

PR 27-AUG-1999; 99 JP-0 024 0642 , 

PR 27-DEC~1999; 99 JP-00368991 . 
XX 

PA (NISC-) JAPAN SCI & TECHNOLOGY CORP. 
XX 

PI Haga T, Okuda T; 
XX 

DR WPI; 2001-226688/23. 

DR P-PSDB; AAB74666. 
XX 

PT New rat and human spinal cord high affinity choline transporters, useful 

PT in diagnosis of Alzheimer's disease and screening promoters as drugs for 

PT treating Alzheimer's disease. 
XX 

PS Claim 12; Page 78-82; 90pp; Japanese. 
XX 

CC The present sequence encodes a mouse (Mus musculus) high affinity choline 

CC transporter protein designated cho-1. The cho-1 protein has nootropic and 

CC neuroprotective activities. The cho-1 polynucleotide and protein can be 

CC used for the diagnosis of diseases related to the expression of cho-1 by 

CC comparing the cho-1 polynucleotide sequence in a sample to that of a 



CC control. Drug compositions containing the cho-1 protein or expression 

CC promoters or inhibitors of cho-1 are useful for treating disorders 

CC characterised by abnormal levels of cho-1, such as Alzheimer's disease 
XX 

SQ Sequence 1743 BP; 407 A; 410 C; 409 G; 517 T; 0 U; 0 Others- 



Query Match 78.8%; Score 1373.4; DB 4; Length 1743; 

Best Local Similarity 86.7%; Pred. No. 0; 

Matches 1512; Conservative 0; Mismatches 231; Indels 0; Gaps 0 

Qy 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

III II II I II II I I I I I I I I I I I I I I II III I M I I I I I I II II II III 
Db 1 ATGTCTTTCCACGTAGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 60 

Qy 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 12 0 

II I I I I I I I I I I I I I I I I I I I II I I I II I M II I I I I I I I I I I I I II III III 

Db 61 GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGC7\ACCCAGAAGAGCACAGTGAA 120 

Qy 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

I I I I I I I I I I I II I I I I I II I I I I I I I I M I II I I I I I I I I I I I I I I I I I M I 
Db 121 GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 180 

Qy 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 24 0 

I I M M I I I I I I I I I I II I I I I I I I I I I I I I II I I I II I I I I I II I I I I I I 

Db 181 ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGT^GCAGTGTATGGGCCAGGTTGT 240 

Qy 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mill I M M I I I I I I III 

Db 241 GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 300 

Qy 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

II I I I I I I I I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I Mill II 

Db 301 TTTTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTTCAACAG 360 

Qy 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

II II II I II II II II I II I II M M II II I M II M I M I I II II I MUM 
Db 361 ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGAGAGATGTTC 420 

Qy 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

I II I II I M I M II II II II I II II II I II I II I II II I M II I I II I I II I M II 

Db 421 TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCCACCATCAGCGTGATCATTGATGTGGAT 48 0 

Qy 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

II II II II Mill II I I II M M II II II II III II II II Mill III 
Db 481 GTGAACATATCGGTCATTGTCTCTGCACTCATTGCCATTCTTTATACCCTAGTGGGTGGG 540 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

II I I I II M M I I II II I II I II II I I I II II I II I II I II I I I I M I II I 

Db 541 CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTTTTATAGGACTGTGG 600 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I II I I Mill Mill II II II II II II I II II I II II I M I Mill MUM 
Db 601 ATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAGCTGTG 660 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

M I M II II II M I M 11 II II II II I I II I II I II II II II M I I II 

Db 661 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 72 0 



Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I Mill I I I I I I 

Db 721 CTTGATAATTTTCTGTTATTGATGCTGGGTGG7\ATCCCATGGCAAGCCTACTTCCAGAGG 7 80 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTC7\AGTGCTGTCCTTCCTGGCAGCTTTCGGG 84 0 

M I II II I II I I M I I I M M M I II I I I I II I I I I I M I I I I I I II I I I I I III 

Db 781 GTCCTCTCTTCATCCTCAGCCACCTATGCTCAGGTACTGTCCTTCCTGGCAGCTTTTGGG 840 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATC7y\CAGACTGG 900 

I I I I I M I I M M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I 

Db 841 TGCCTGGTGATGGCTCTACCCGCCATATGCATAGGAGCTATTGGAGCTTCCACAGACTGG 900 

Qy 901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I II I II I M I I II III I I I I I I I I I I I I I M I I II II I I I I I I I I I I I I I 

Db 901 AACCAGACTGCCTACGGGTATCCAGATCCCAAGACTAAGGAGGAAGCAGACATGATTCTC 960 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II II I II I I I I I I M I I I I I I I I I I I I I I II II M I I I I I I I N I I I I I IN 

Db 961 CCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGGCTTGGTGCTGTT 1020 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

II I I M I I I I I I I I I I I I I I II M I I I I I I MM M Mill I II I I M I 

Db 1021 TCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGTTCTATGTTTGCT 1080 

Qy 1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

Mill I I M II I I I I II II I II M I I I I I II II I II I I I II I II I I II Mill 

Db 1081 CGGAATATCTACCAGCTTTCCTTCAGACAA75^TGCATCAGACAAGGA7^TTGTGTGGGTC 1140 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

Ml I Mill Ml M II II I M II I II I I M II I M II II II I I II II II II I I 

Db 1141 ATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGCCATGGCTTTGCTGACGAAG 1200 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

M I II II M II I I I I II II II M II I I II I II II I I MUM I M I II III III 

Db 1201 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1260 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

Mill I II II II II II I II I II II II II I I I I II II II II II II II II II 
Db 1261 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1320 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I III II II I I II I II M M I II I I I II I II II II M II I I II II I II III 

Db 1321 TTTGGACTATTCCTGAGAATTACTGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1380 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

II I M II I I I I II M II I I II I II II I M II I I I II II II II II I I II I I I 

Db 1381 TTCTACCCTGGTTATTACTCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 1440 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

MM I II I I M II I I II II I M I II II II I I M I I I M II I II I II M I M 

Db 1441 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCTTATCTAGCCAAGTAT 1500 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

II II II M II M II M II I II I I II II II II II I I II I II II II II M I II I I II I 
Db 1501 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATGTATTTGATGCTGTTGTCGCAAGG 1560 



Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

I I I I I I I I I I I I I I I I I I I Mill 11 I I I II I I I I I I I I I I I I II I I I I I I II 
Db 1561 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAATATCAAATTAAAT 162 0 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I I II I M I II I I M II II II I M I I I I I II M I I I I I I I I I I I I I I II I I I I 
Db 1621 GAACTTGCACCTGTGAAACCTCGGCAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAG 1680 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

I I I II I I I I II II I II I I I I I I I I I I M II I I I I I I I I I I M II I I I II I I I I II 

Db 1681 GAGGCCCTCCTTGATGTTGATTCCAGTCCGGAGGGGTCTGGGACTGAAGATAACTTACAA 1740 

Qy 1741 TGA 1743 

I I I 

Db 1741 TGA 1743 



RESULT 11 
AAD02457 

ID AAD02457 standard; cDNA; 4938 BP. 
XX 

AC AAD02457; 
XX 

DT 24-APR-2001 (first entry) 
XX 

DE Mouse P4P6B1 OMA (obese mice adipocyte) protein encoding cDNA. 
XX 

KW Mouse; OMA protein; obese mice adipocyte; P4P6B1; 

KW fuel metabolism disorder; therapy; obesity; diabetes; gene therapy; 

KW anorectic; antidiabetic; ss. 

XX 

OS Mus sp . 

XX 

FH Key Location/Qualifiers 
FT CDS 247. .1989 

FT /*tag= a 

FT /product= "Mouse P4P6B1 OMA protein" 

FT misc_feature 988. .1342 
FT /*tag= b 

FT /note= "Portion of original 450 bp PGR fragment" 

XX 

PN WO200078950-A2 . 
XX 

PD 28-DEC-2000. 
XX 

PF 13-JUN-2000; 2000WO-US016217 . 
XX 

PR 22-JUN-1999; 99US-0141515P . 
XX 

PA (AMYL-) AMYLIN PHARM INC. 
XX 

PI Sierzega M, Albrandt K; 

XX 

DR WPI; 2001-112322/12. 
DR P-PSDB; AAY72388. 
XX 

PT Novel obese mice adipocyte polypeptides useful in diagnosis and treatment 



PT of disorders of fuel metabolism such as obesity or diabetes. 

XX 

PS Claim 2; Fig 3; 83pp; English. 
XX 

CC The present sequence is mouse P4P6B1 cDNA which encodes OMA (obese mice 

CC adipocyte) protein. The P4P6B1 fragment was generated by RNA 

CC fingerprinting using random primers P4 and P6. OMA is used as a 

CC diagnostic reagent for diagnosing a disorder of fuel metabolism in an 

CC underweight or an overweight individual, by detecting the transcription 

CC level of a gene encoding OMA, which is induced or repressed in an 

CC individual by a factor such as genetic obesity, fasting and refeeding of 

CC a fasted individual. OMA is useful in the generation of antibodies, for 

CC use in pharmaceutical compositions and for studying DNA/protein 

CC interactions. Nucleic acids encoding OMA are involved in gene therapy. An 

CC inhibitor of OMA or an antisense oligonucleotide that inhibits expression 

CC of OMA are useful for treating disorders of fuel metabolism such as 

CC obesity or diabetes 

XX 

SQ Sequence 4938 BP; 1436 A; 1012 C; 976 G; 1514 T; 0 U; 0 Others- 
Query Match 78.8%; Score 1373.4; DB 5; Length 4938; 
Best Local Similarity 86.7%; Pred. No. 0; 

Matches 1512; Conservative 0; Mismatches 231; Indels 0; Gaps 0; 

ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

III II I I II I II II I I M I II I II I I I I III I I I I M I I I II II I III 
ATGTCTTTCCACGTAGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATACTTCTG 306 

GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 120 

II II II II II II I II II I II I I I II I I I I I I II I I I I I I I I M I I II I I II III 
GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGCAACCCAGAAGAGCGCAGTGAA 366 

GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

II II II I I I I I II I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I II II I 
GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 42 6 

ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 240 

1111111111111111 II II I I I I II I II I I I II I I I II MM II II I II I 
ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGAAGCAGTGTATGGGCCAGGTTGT 486 

GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

II II I II II II M II II I II M I II II II II II I I I Mill I I II I II II II III 
GGTCTAGCTTGGGCTCAGGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 546 

TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

II II I I I II I M I II I II II I I II II II II II I I II M M I II II Mill II 

TTTTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTTCAACAG 606 

ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I II I II M I II M II II I I II II II II II I I II I II II I M II I II Mini 
ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGAGAGATGTTC 666 

TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

II II I II II II I M I II II M II M II M II M II I M II II M II I II I M II II 
TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCCACCATCAGCGTGATCATTGATGTGGAT 72 6 



Qy 


1 


Db 


247 


Qy 


61 


Db 


307 


Qy 


121 


Db 


367 


Qy 


181 


Db 


427 


Qy 


241 


Db 


487 


Qy 


301 


Db 


547 


Qy 


361 


Db 


607 


Qy 


421 


Db 


667 



Qy 



4 81 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 



II I I I I II I I I II I II II I I II I I I I I II II III II II II I II M III 
Db 727 GTGAACATATCGGTCATTGTCTCTGCACTCATTGCCATTCTTTATACCCTAGTGGGTGGG 786 

Qy 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I I I I I I M I I II I II II I I I I I I II II I I I II I I M I I II I I I I I MUM 

Db 787 CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTTTTATAGGACTGTGG 84 6 

Qy 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I II M II I II MM! II II I II M I M M I II I M M M M Mill I II II I 

Db 847 ATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAGCTGTG 906 

Qy 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

Mill II II M I I I M II II II II II I II I I III II M II I M I I III 

Db 907 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 966 

Qy 721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

II II II I II M II II I M II M II II I II II II II II II M M M Mill I II II I 

Db 967 CTTGATAATTTTCTGTTATTGATGCTGGGTGGAATCCCATGGCAAGCCTACTTCCAGAGG 1026 

Qy 781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 84 0 

II II M I I II I II II II I II M M I M I II II I II II M II I II II I M II I III 

Db 1027 GTCCTCTCTTCATCCTCAGCCACCTATGCTCAGGTACTGTCCTTCCTGGCAGCTTTTGGG 1086 

Qy 841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

II M II II M II II I II II II II III M II II II M II II M II M I II 
Db 1087 TGCCTGGTGATGGCTCTACCCGCCATATGCATAGGAGCTATTGGAGCTTCCACAGACTGG 114 6 

Qy 901 T^CCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I II I M I M I I II III I II II II II II I I II M II II I II II I M II I I I 

Db 1147 AACCAGACTGCCTACGGGTATCCAGATCCC7y\GACTAAGGAGGAAGCAGACATGATTCTC 1206 

Qy 961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II II II II II M II I M II M II I II II I II II II II II II M II II II III 

Db 1207 CCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGGCTTGGTGCTGTT 1266 

Qy 1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

M I M I II I I I II II II II I M M I II I M MM M I I II I II M II I I 

Db 12 67 TCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGTTCTATGTTTGCT 1326 

Qy 1081 -CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACATVVGAAATCGTTTGGGTT 1140 

II II I I II II I M II II M M II II II I II II M II Mill Mill II I II I I 

Db 1327 CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 1386 

Qy 1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

III I I I II I III II II II I II I II II I II II I I II I II II II II I II I II M I 

Db 1387 ATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGCCATGGCTTTGCTGACGAAG 1446 

Qy 1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

II M II I II II II II II II II II II II II II M II I M II II I I II II II I III 
Db 1447 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1506 

Qy 1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

I I II I I I I II I II II I I II II I II II I II I I I I II II I II M II I I II II 

Db 1507 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1566 

Qy 1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I III II II II II II II I II II II II II M II I II II II I I II II II I III 



Db 1567 TTTGGACTATTCCTGAGAATTACTGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1626 

Qy 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 144 0 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I II I I I I I I I I I 

Db 1627 TTCTACCCTGGTTATTACTCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 1686 

Qy 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

II II I I II II II II I I I I I I I I I I II II I I I I I I II I I I I I I I I I I I I I I I 

Db 168 7 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCTTATCTAGCCAAGTAT 174 6 

Qy 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I M I I II M I I I I I I I I I I II II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1747 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATGTATTTGATGCTGTTGTCGCAAGG 1806 

Qy 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 162 0 

II M M II II I I I I I I I I I I! I II I I II I I I I I I I I II I I I I I I I I I I I I I II 

Db 18 07 CACAGTGAAGAGAACATGGAC7\AGACCATTCTAGTCAGAAATGAAAATATCAAATTAAAT 18 66 

Qy 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 168 0 

M I I I I I I I I II I I I I II II MINI I I M M I I I I I I I I I I I I I I I I I I I I 
Db 1867 GAACTTGCACCTGTGAAACCTCGGCAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAG 1926 

Qy 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 174 0 

I I I I I I I I I I I I I I I M I I I I I I I I I I I II I I I II I I I I I M I I I I I I M Mill 

Db 1927 GAGGCCCTCCTTGATGTTGATTCCAGTCCGGAGGGGTCTGGGACTGAAGATAACTTACAA 1986 

Qy 1741 TGA 1743 

I I I 

Db 1987 TGA 1989 



RESULT 12 
AAH49206 

ID AAH49206 standard; DNA; 8760 BP. 
XX 

AC AAH49206; 
XX 

DT 26-NOV-2001 (first entry) 
XX 

DE Human CHOT exons 6, 1, 8 and 3* UTR region DNA. 
XX 

KW CHOT; human; choline transporter; chromosome 2qll-13; nootropic; 

KW neuroprotective; gene therapy; antisense therapy; degenerative disease; 

KW cognitive disorder; Alzheimer's disease; ds . 

XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT CDS 1. .4853 

FT /*tag= a 

FT /product^ "CHOT" 

FT /note= "This sequence is interrupted by introns" 

FT exon 41. .194 

FT /*tag= b 

FT / number = 6 

FT intron 195. .2456 

FT /*tag= c 



FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
PA 
XX 
PI 
XX 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 

cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

XX 
SQ 



exon 



intron 



exon 



3'UTR 



/nuinber= 6 
2455. .2674 
/*tag= d 
/nuinber= 7 
2675. .4223 
/*tag= e 
/nuinber= 7 
4224. .4853 
/*tag= f 
/number= 8 
4854. .8760 
/*tag= g 



DE10009055-A1. 
30-AUG-2001. 

28-FEB-2000; 2000DE-01009055 . 

28-FEB-2000; 2000DE-01009055 . 

(BRUE/) BRUESS M. 
(BOEN/) BOENISCH H. 

Bruess M;- Boenisch H; 

WPI; 2001-590709/67. 

A new gene encoding human choline transporter, designated hCHOT is 
located on chromosome 2qll-13 and is useful to treat degenerative 
disorders such as Alzheimer's disease. 

Disclosure; Page 9-11; 12pp; German. 

This invention describes a novel gene encoding human choline transporter, 
designated hCHOT which is located on chromosome 2qll-13. The products of 
the invention have nootropic and neuroprotective activity and can be used 
for gene or antisense therapy. (I) is used to treat degenerative disease, 
particularly cognitive disorders such as Alzheimer's disease. Sense and 
antisense oligonucleotides derived from the gene may be used in 
diagnostics and other techniques. This sequence represents exons 6-8 and 
the 3* UTR fragment encoding the human CHOT protein described in the 
invention 

Sequence 8760 BP; 2727 A; 1619 C; 1565 G; 2849 T; 0 U; 0 Other; 



Query Match 36.2%; 

Best Local Similarity 99.7%; 
Matches 632; Conservative 



Score 630.8; DB 5; 
Pred. No. 1.8e-172; 
0; Mismatches 2; 



Length 8760; 

Indels 0; 



Gaps 



0; 



Qy 1110 AAATGCTTCGGACAAAGAAATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGC 1169 

I I I I I I I I I 11 I I I I I M I I I I I I I I I I I I I M I M I I I I I I I I I M I I I I I I I I I I I 

Db 4220 ACAGGCTTCGGACAAAGAAATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGC 4279 

Qy 1170 ATCTGCAACAGCCATGGCCTTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTC 1229 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I M M I I 

Db 4280 ATCTGCAACAGCCATGGCCTTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTC 4339 



Qy 1230 TGACCTTGTTTACATCGTTATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAAC 1289- 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 434 0 TGACCTTGTTTACATCGTTATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTT7\AGGGAAC 4399 

Qy 12 90 CAACACCTATGGGGCCGTGGCAGGTTATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGG 1349 

I I I I I I I I I I I I M I I I I I I M I I I I M I I I I I M I I I I I I I I I I I I M I I I I I I M I I I 

Db 4400 CAACACCTATGGGGCCGTGGCAGGTTATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGG 4459 

Qy 1350 GGAGCCATATCTGTATCTTCAGCCCTTGATCTTCTACCCTGGCTATTACCCTGATGATAA 1409 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4460 GGAGCCATATCTGTATCTTCAGCCCTTGATCTTCTACCCTGGCTATTACCCTGATGATAA 4519 

Qy 1410 TGGTATATATAATCAGAAATTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAAC 14 69 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 4520 TGGTATATATAATCAGAAATTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAAC 4579 

Qy 1470 CAACATTTGCATCTCCTATCTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAA 1529 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 4580 CAACATTTGCATCTCCTATCTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAA 4639 

Qy 1530 ATTAGATGTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGATAAGACAAT 1589 

M I I M I I I I I I I I I M I I I I I I I I I I I I I I I M I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 4640 ATTAGATGTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGATAAGACAAT 4699 

Qy 1590 TCTTGTCAAAAATGAAAATATTAAATTAGATGAACTTGCACTTGTGAAGCCACGACAGAG 164 9 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 
Db 4700 TCTTGTCAAAAATGAAAATATTAAATTAGATGAACTTGCACTTGTGAAGCCACGACAGAG 4759 

Qy 1650 CATGACCCTCAGCTCAACTTTCACCAATAAAGAGGCCTTCCTTGATGTTGATTCCAGTCC 1709 

I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4760 CATGACCCTCAGCTCAACTTTCACCAATAAAGAGGCCTTCCTTGATGTTGATTCCAGTCC 4819 

Qy 1710 AGAAGGGTCTGGGACTGAAGATAATTTACAGTGA 1743 

M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 
Db 4820 AGAAGGGTCTGGGACTGAAGATAATTTACAGTGA 4853 



RESULT 13 

ADD50656 

ID ADD50656 standard; DNA; 119040 BP. 
XX 

AC ADD50656; 
XX 

DT 15-JAN-2004 (first entry) 
XX 

DE BAG sequence containing hCHT gene. 

XX 

KW Human; high-affinity choline transporter; hCHT; chromosome 2ql2; 

KW cholinergic function; Parkinson's disease; Huntington's disease; 

PCW Alzheimer's disease; schizophrenia; dysautonomia; myasthenia gravis; 

KW brain; cholinergic signalling; antiparkinsonian; anticonvulsant; 

KW nootropic; neuroprotective; neuroleptic; bacterial artificial chromosome; 

KW BAG; ds . 

XX 

OS Homo sapiens. 
XX 



PN US2003114399-A1. 
XX 

PD 19-JUN-2003. 
XX 

PF 23-JUL-2001; 2001US-00911077 . 
XX 

PR 23-JUL-2001; 2001US-00911077 . 
XX 

PA (BLAK/) BLAKELY R D. 

PA (APPA/) APPARSUNDARAM S. 

PA (FERG/) FERGUSON S. 

XX 

PI Blakely RD, Apparsundaram S, Ferguson S; 
XX 

DR WPI; 2003-810914/76. 
XX 

PT Novel isolated polynucleotide encoding human or mouse high affinity 

PT choline transporter polypeptide^ useful in gene therapy to increase 

PT cholinergic function in a cell of a patient suffering from Alzheimer ' s 

PT disease. 
XX 

PS Example 3; SEQ ID NO 19; 74pp; English. 
XX 

CC The present invention relates to the isolation of polynucleotide 

CC sequences encoding human and mouse high-affinity choline transporter 

CC (hCHT and mCHT respectively), and the proteins they encode. The gene 

CC encoding hCHT is located on chromosome 2ql2. The polynucleotide sequence 

CC encoding hCHT, is useful for expressing hCHT recorabinantly , The hCHT 

CC polynucleotide sequence when delivered to a cell;, increases cholinergic 

CC function in the cell that is in a patient having Parkinson's disease, 

CC Huntington's disease, Alzheimer's disease, schizophrenia, dysautonomia or 

CC myasthenia gravis. The hCHT antibody is useful for controlling 

CC transporter CHT proteins to the brain, and for treating the above 

CC mentioned diseases. The antibody is also useful for diagnosing the above 

CC mentioned disorders and to detect the influence of cholinergic 

CC signalling. The present sequence represents a bacterial artificial 

CC chromosome (BAC) sequence containing the hCHT gene. Note: The sequence 

CC data for this patent was obtained in electronic format directly from the 

CC USPTO web site at seqdata.uspto.gov. 

XX 

SQ Sequence 119040 BP; 37072 A; 22876 C; 21708 G; 36882 T; 0 U; 502 Other; 



Query Match 36.2%; Score 630.8; DB 9; Length 119040; 

Best Local Similarity 99.7%; Pred. No. 7.1e-172; 

Matches 632; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 1110 AAATGCTTCGGACAAAGAAATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGC 1169 

I I I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 30755 ACAGGCTTCGGACAAAGAAATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGC 30814 

Qy 1170 ATCTGCAACAGCCATGGCCTTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTC 1229 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 30815 ATCTGCAACAGCCATGGCCTTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTC 30874 

Qy 1230 TGACCTTGTTTACATCGTTATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAAC 1289 

I I I I I I M I I I I I M I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 30875 TGACCTTGTTTACATCGTTATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAAC 30934 



Qy 1290 CAACACCTATGGGGCCGTGGCAGGTTATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGG 1349 

I I I I I I I M I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I 

Db 30935 CAACACCTATGGGGCCGTGGCAGGTTATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGG 30994 

Qy 1350 GGAGCCATATCTGTATCTTCAGCCCTTGATCTTCTACCCTGGCTATTACCCTGATGATAA 1409 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 30995 GGAGCCATATCTGTATCTTCAGCCCTTGATCTTCTACCCTGGCTATTACCCTGATGATAA 31054 

Qy 1410 TGGTATATATAATCAGAAATTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAAC 1469 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 31055 TGGTATATATAATCAGAAATTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAAC 31114 

Qy 147 0 CAACATTTGCATCTCCTATCTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAA 1529 

I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 
Db 31115 CAACATTTGCATCTCCTATCTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAA 31174 

Qy 1530 ATTAGATGTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGATAAGACAAT 1589 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 
Db 31175 ATTAGATGTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGATAAGACAAT 31234 

Qy 1590 TCTTGTCAAAAATGAAAATATTAAATTAGATGAACTTGCACTTGTGAAGCCACGACAGAG 1649 

I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 31235 TCTTGTCAAAAATGAAAATATTAAATTAGATGAACTTGCACTTGTGAAGCCACGACAGAG 31294 

Qy 1650 CATGACCCTCAGCTCAACTTTCACCAATAAAGAGGCCTTCCTTGATGTTGATTCCAGTCC 1709 

I I I I I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 31295 CATGACCCTCAGCTCAACTTTCACCAATAAAGAGGCCTTCCTTGATGTTGATTCCAGTCC 31354 

Qy 1710 AGAAGGGTCTGGGACTGAAGATAATTTACAGTGA 1743 

I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
Db 31355 AGAAGGGTCTGGGACTGAAGATAATTTACAGTGA 31388 



RESULT 14 


ADD50651 


ID 


ADD50651 standard; DNA; 142299 BP. 


XX 




AC 


ADD50651; 


XX 




DT 


15-JAN-2004 (first entry) 


XX 




DE 


BAG sequence #2 containing hCHT DNA. 


XX 




KW 


Human; high-affinity choline transporter; hCHT; chromosome 2ql2; 


KW 


cholinergic function; Parkinson's disease; Huntington's disease; 


KW 


Alzheimer's disease; schizophrenia; dysautonomia; myasthenia gravis; 


KW 


brain; cholinergic signalling; antiparkinsonian; anticonvulsant; 


KW 


nootropic; neuroprotective; neuroleptic; bacterial artificial chromosome; 


KW 


BAG; ds. 


XX 




OS 


Homo sapiens. 


XX 




PN 


US2003114399-A1. 


XX 




PD 


19-JUN-2003. 


XX 





PF 23-JUL-2001; 2001US-00911077 . 
XX 

PR 23-JUL-2001; 2001US-00911077 . 
XX 

PA (BLAK/) BLAKELY R D. 

PA (APPA/) APPARSUNDARAM S. 

PA (FERG/) FERGUSON S. 

XX 

PI Blakely RD, Apparsundaram S, Ferguson S; 
XX 

DR WPI; 2003-810914/76. 
XX 

PT Novel isolated polynucleotide encoding human or mouse high affinity 

PT choline transporter polypeptide^ useful in gene therapy to increase 

PT cholinergic function in a cell of a patient suffering from Alzheimer's 

PT disease. 
XX 

PS Example 1; SEQ ID NO 14; 74pp; English. 
XX 

CC The present invention relates to the isolation of polynucleotide 

CC sequences encoding human and mouse high-affinity choline transporter 

CC (hCHT and mCHT respectively)^ and the proteins they encode. The gene 

CC encoding hCHT is located on chromosome 2ql2. The polynucleotide sequence 

CC encoding hCHT, is useful for expressing hCHT recombinantly . The hCHT 

CC polynucleotide sequence when delivered to a cell,- increases cholinergic 

CC function in the cell that is in a patient having Parkinson's disease, 

CC Huntington's disease, Alzheimer's disease, schizophrenia, dysautonomia or 

CC myasthenia gravis. The hCHT antibody is useful for controlling 

CC transporter CHT proteins to the brain, and for treating the above 

CC mentioned diseases. The antibody is also useful for diagnosing the above 

CC mentioned disorders and to detect the influence of cholinergic 

CC signalling. The present sequence represents a bacterial artificial 

CC chromosome (BAC) sequence containing hCHT DNA. Note: The sequence data 

CC for this patent was obtained in electronic format directly from the USPTO 

CC web site at seqdata.uspto.gov. 

XX 

SQ Sequence 142299 BP; 42714 A; 27041 C; 26747 G; 44494 T; 0 U; 1303 Other; 

Query Match 36.2%; Score 630.8; DB 9; Length 142299; 
Best Local Similarity 99.7%; Pred. No. 7.8e-172; 

Matches 632; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1110 j^AATGCTTCGGACAAAGAAATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGC 1169 

I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 94 673 ACAGGCTTCGGACAAAGAAATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGC 94732 

Qy 1170 ATCTGCAACAGCCATGGCCTTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTC 1229 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I M M I I I.I I 

Db 94733 ATCTGCAACAGCCATGGCCTTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTC 94792 

Qy 1230 TGACCTTGTTTACATCGTTATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAAC 1289 

I I I I I I I I II II M II II I I I I I I M II I I I M I I II I I I I II II i I II II II M I II II 

Db 94793 TGACCTTGTTTACATCGTTATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAAC 94852 

Qy 1290 CAACACCTATGGGGCCGTGGCAGGTTATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGG 134 9 

I I I I I I I I I I I I I II I II I I I I I I I M I I I I I I I I I I II I I I II I I I I I I I I I I M I I M 

Db 94853 CAACACCTATGGGGCCGTGGCAGGTTATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGG 94912 



Qy 1350 GGAGCCATATCTGTATCTTCAGCCCTTGATCTTCTACCCTGGCTATTACCCTGATGATAA 1409 

I I I I I I I I M I M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 

Db 94913 GGAGCCATATCTGTATCTTCAGCCCTTGATCTTCTACCCTGGCTATTACCCTGATGATAA 94 972 

Qy 1410 TGGTATATATAATCAGAAATTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAAC 14 69 

I I I I I I I I I I I I I I I I I I I I I M I M I M I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 

Db 94973 TGGTATATATAATCAGAAATTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAAC 95032 

Qy 1470 CAACATTTGCATCTCCTATCTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAA 1529 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 
Db 95033 CAACATTTGCATCTCCTATCTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAA 95092 

Qy 1530 ATTAGATGTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGATAAGACAAT 1589 

I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 95093 ATTAGATGTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGATAAGACAAT 95152 

Qy 1590 TCTTGTCAAAAATGAAAATATTAAATTAGATGAACTTGCACTTGTGAAGCCACGACAGAG 1649 

I I I I I I I I I I I M M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 95153 TCTTGTCAAAAATGAAAATATTAAATTAGATGAACTTGCACTTGTGAAGCCACGACAGAG 95212 

Qy 1650 CATGACCCTCAGCTCAACTTTCACCAATAAAGAGGCCTTCCTTGATGTTGATTCCAGTCC 17 09 

I I I I I I I I I M M I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 95213 CATGACCCTCAGCTCAACTTTCACCAATAAAGAGGCCTTCCTTGATGTTGATTCCAGTCC 95272 

Qy 1710 AGAAGGGTCTGGGACTGAAGATAATTTACAGTGA 1743 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I 
Db 95273 AGAAGGGTCTGGGACTGAAGATAATTTACAGTGA 95306 



RESULT 15 

AAF81710 

ID AAF81710 standard; cDNA; 1731 BP. 
XX 

AC AAF81710; 
XX 

DT Ol-JUN-2001 (first entry) 
XX 

DE C. elegans high affinity choline transporter protein encoding cDNA. 

XX 

KW High affinity choline transporter; cho-1; Alzheimer's disease; diagnosis; 

KW ss . 

XX 

OS Caenorhabditis elegans . 
XX 

FH Key Location/Quctlif iers 

FT CDS 1. .1731 

FT /*tag= a 

FT /product= "high affinity choline transporter" 

XX 

PN WO200116315-A1. 

XX 

PD 08-MAR-2001. 
XX 

PF 18-AUG-2000; 2000WO- JP005545 . 
XX 

PR 27-AUG-1999; 99 JP-00240642 . 



PR 27-DEC-1999; 99 JP-00368991 . 
XX 

PA (NISC-) JAPAN SCI & TECHNOLOGY CORP. 
XX 

PI Haga T, Okuda T; 
XX 

DR WPI; 2001-226688/23. 

DR P-PSDB; AAB74663. 
XX 

PT New rat and human spinal cord high affinity choline transporters^ useful 

PT in diagnosis of Alzheimer's disease and screening promoters as drugs for 

PT treating Alzheimer's disease. 
XX 

PS Claim 3; Page 57-62; 90pp; Japanese. 
XX 

CC The present sequence encodes a Caenorhabditis elegans high affinity 

CC choline transporter protein designated cho-1. The cho-1 protein has 

CC nootropic and neuroprotective activities. The cho-1 polynucleotide and 

CC protein can be used for the diagnosis of diseases related to the 

CC expression of cho-1 by comparing the cho-1 polynucleotide sequence in a 

CC sample to that of a control. Drug compositions containing the cho-1 

CC protein or expression promoters or inhibitors of cho-1 are useful for 

CC treating disorders characterised by abnormal levels of cho-1, such as 

CC Alzheimer's disease 
XX 

SQ Sequence 1731 BP; 428 A; 373 C; 427 G; 503 T; 0 U; 0 Other; 

Query Match 20.9%; Score 363.8; DB 4; Length 1731; 

Best Local Similarity 55.1%; Pred. No. 5.5e-95; 

Matches 862; Conservative 0; Mismatches 637; Indels 66; Gaps 5 

Qy 19 GGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTGGTTGGAATATGGGCTGCC 78 

II I I II I I I I I I I I I I I II M I I I I I I I I I I I I I I II I I 
Db 16 GGTATCGTGGCCATTGTGTTCTTCTACGTGCTCATTCTTGTCGTTGGAATATGGGCGGGT 75 

Qy 7 9 TGGAGAACCAAAA ACAGTGGCAGCGCAGAAGAGCGCAGCGAAGCCATC 126 

I I I I I II I I I I I I I I I I I I 

Db 76 AGAAAATCGAAAAGTTCAAAAGAGCTTGAATCAGAAGCCGGCGCGGCGACGGAAGAGGTG 135 

Qy 127 ATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCTACCTGG 186 



Db 



136 ATGTTAGCTGGGAGAAACATCGGAACTCTTGTCGGAATTTTCACAATGACTGCCACGTGG 195 



Qy 



187 GTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTATGGCCTA 24 6 



Db 



196 GTTGGCGGTGCTTATATCAATGGAACCGCCGAGGCTCTGTATAATGGAGGT CTC 24 9 



Qy 



247 GCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTGTTCTTT 306 



Db 



250 CTTGGATGTCAGGCTCCAGTTGGATATGCAATTTCCCTTGTTATGGGAGGACTACTTTTC 309 



Qy 



307 GCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAAATCTAT 366 



Db 



310 GCAAAGAAAATGCGAGAAGAAGGATATATTACAATGCTCGATCCTTTTCAGCACAAATAT 369 



Qy 



367 GGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTCTGGGCT 426 
II I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



370 GGCCAACGJ^ATCGGTGGCTTGATGTATGTTCCAGCACTTCTTGGTGAAACATTCTGGACA 429 



Qy 427 GCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGATATGCAC 4 86 

I I I I I I I I I I I I I I I I I I I I I I I I I II I II 1 II I 

Db 430 GCAGCCATTCTTTCGGCACTTGGTGCAACACTGTCGGTAATTCTTGGAATCGACATGAAT 489 

Qy 487 ATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGGCTCTAT 546 

I I II I I I I I I I MINI I I I I I II I I I II I I I I 

Db 4 90 GCATCAGTGACCCTGTCGGCCTGTATTGCCGTATTCTACACATTCACCGGTGGATACTAT 54 9 

Qy 547 TCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGGATCAGC 606 

I I I M I I M I I M I I M M I I M I I M I I I M I I M I I I I I I I I M 

Db 550 GCAGTCGCGTACACTGACGTCGTTCAACTATTTTGCATTTTCGTCGGTTTGTGGGTTTGC 609 

Qy 607 GTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTGCATGCC 666 

MM MM I II M I II II Mill 

Db 610 GTGCCGGCGGCTATGGTGCATGATGGTGCGAAGGATATTTCCAGGAATGCAG 661 

Qy 667 AAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGGCTTGAT 72 6 

I I I I I I I I Ml I II I I M II 

Db 662 GCGACTGGATTGGAGAGATTGGAGGATTCAAAGAAACATCTCTCTGGATTGAT 714 

Qy 727 AGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGGGTTCTC 786 

I III I I I I II M II I I II II II I I I II I M II M I II I II 

Db 715 TGCATGCTTCTCCTTGTCTTTGGAGGAATTCCATGGCAAGTGTACTTCCAAAGAGTTCTC 774 

Qy 787 TCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGGTGCCTG 846 

MM III II II I II I I I I II II I I I I I I M I 

Db 775 TCCTCAAAAACTGCTCATGGAGCACAGACGTTGTCGTTTGTGGCGGGCGTCGGATGCATT 834 

Qy 847 GTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGGAACCAG 906 

I II II I M II I I I M II II II M II I II II M I I 

Db 835 CTCATGGCGATTCCACCAGCGTTGATCGGTGCAATTGCCAGGAACACAGACTGGAGAATG 894 

Qy 907 ACTGCATATGGGCTTCC AGATCCCAAGACTACAGAAGAGGCA 948 

II II II 1 I MM I II I II I 

Db 895 ACTGATTATTCCCCATGGAACAATGGAACTAAGGTCGAATCGATTCCACCGGATAAGAGA 954 

Qy 949 GACATGATTTTACCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGT 1008 

I II II I III Ml II II II II I II I I III MM 

Db 955 AACATGGTGGTCCCGTTGGTATTCCAGTATCTTACGCCAAGATGGGTCGCCTTTATTGGA 1014 

Qy 1009 CTTGGTGCAGTTTCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGT 1068 

M M Mill M I II I 11 M II II I II I II II I I II II I I M M M 

Db 1015 CTCGGCGCAGTGTCGGCTGCTGTAATGTCATCTGCAGATTCATCTGTACTATCAGCAGCA 1074 

Qy 1069 TCCATGTTTGCACGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAA 1128 

II II II II II I M II M I MM I I II II M II II II II I 

Db 1075 TCAATGTTTGCTCACAACATCTGGAAGCTCACAATTCGCCCTCACGCGTCTGAAAAAGAA 1134 

Qy 1129 ATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCC 1188 

I I II II II II I I I I II I I II I II II II I I 

Db 1135 GTGATAATTGTGATGAGAATAGCCATCATCTGTGTTGGTATCATGGCT^CCATCATGGCA 1194 

Qy 1189 TTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTT 1248 

I I III M II I M II II II II I Mill M M II II I 

Db 1195 CTTACCATTCAATCCATCTATGGGCTTTGGTATCTTTGTGCAGATTTGGTCTACGTCATA 1254 



Qy 1249 ATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTG 1308 

I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I II 

Db 1255 CTCTTCCCTCAACTATTATGTGTTGTATATATGCCACGTAGCAATACGTATGGCTCATTG 1314 

Qy 1309 GCAGGTTATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTT 1368 

II II I II I MM III I I I I I II I I I I M I I III M 

Db 1315 GCTGGCTATGCAGTCGGTCTTGTGCTCCGTTTGATTGGAGGCGAGCCACTTGTATCGCTG 1374 

Qy 1369 CAGCCCTTGATCTTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAA 1428 

I I I I I I MM III III MM 
Db 1375 CCAGCGTTCTTCCATTATCCAATGTATACGGATGGGG TACAGTAT 1419 

Qy 1429 TTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTAT 14 88 

I I I I I I I I III I M I II I I I II I I I I I III 

Db 1420 TTCCCATTCAGGACAACTGCTATGTTATCTTCAATGGCTACTATCTACATTGTATCAATA 1479 

Qy 1489 CTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTT^AATTAGATGTATTTGATGCT 1548 

III I I I I I I I II III I II I I I II I I I I I I I I I 

Db 1480 CAATCGGAGAAGCTGTTCAAATCGGGACGTTTGTCTCCGGAGTGGGACGTAATGGGTTGT 1539 

Qy 1549 GTTGT 1553 

I I I I 

Db 1540 GTAGT 1544 



Search completed: September 28, 2004, 21:08:37 
Job time : 1015 sees 



GenCore version 5.1-6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search^ using sw model 
Run on: September 28 



2004, 20:43:21 ; Search time 151 Seconds 
(without alignments) 
6405.824 Million cell updates/sec 



Title: US-10-069-541-5 

Perfect score: 1743 
Sequence: 

Scoring table: 
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ALIGNMENTS 



RESULT 1 
US-09-657-252-1 

Sequence 1, Application US/09657252 
Patent No. 6500643 
GENERAL INFORMATION: 
APPLICANT: Wu, Dong-Hai 
APPLICANT: Gu, Yunrong 
APPLICANT: Millard, William 

?!?LETi J»ilONr'\>».n High Affinity Choline Transporter cOHA 

FILE REFERENCE: MBHBOO-639 
CURRENT APPLICATION NUMBER: US/09/657,252 
CURRENT FILING DATE: 2000-09-07 
NUMBER OF SEQ ID NOS : 6 
SOFTWARE: Patentin version 3.0 
SEQ ID NO 1 
LENGTH: 1743 
TYPE: DNA 



ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (1) . . (1743) 
US-09-657-252-1 

Query Match 99.7%; Score 1738.2; DB 4; Length 1743; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 1740; Conservative 0; Mismatches 3; Indels 0; Gaps U; 



Q„ 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 

I M I I I I I I I I I I I I I I I M I I M I I M I I I I I I I I M I M I I I M I I I I I I I M I I I I I 

ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 



Db 1 



Q„ 61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 

MIIIIIIIMIIMIIMMIIIIIMIIMIIMIIIIIIIMMIMMIIIMIII 

GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 



Db 61 

Ov 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 

IMIIIIIIIIIIIIIIMIIIIIIMIIIII I I I I I I I I 1 I I I I I I 

121 GCCATCATAGTTGGTGGCCGAGATATTGGTTT ATTGGTTGGTGGATTTACCATGACAGCT 

QV 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 

I I M I I I I I I I I M I I I I I I I I M I I I I I I I I I I I M M I I I I I I I I I I I M 

Db 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 



60 
60 
120 
120 
180 
180 
240 
240 
300 



OV 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 

MIIIIIIMIIMIIIMMIIIIIMIMIMIIIIIIIIIIMIMMIIIIMIII 

Db 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 



301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 

I M I I I I I I I M I I I I I I I I I I I I I I M i I I I I M I I I M Mi Mini 

Db 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 



Qy 361 

Db 361 

Qy 421 

Db 421 



ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 

I I I I I I I I I M M M M M M M M I M n M M M M M M M M M M M M I 

ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 
TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 

I I I I I I I I I I I I I I I M M M M M M I M M M M M M M M M M M I M M 

TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 



Ov 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 

I I I I M I 1 I I M M M M M M M M M M M M M M M M M M M M M M M M M 

Db 4 81 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 

Ov 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 

^ I M I M M M M M M M M M M I M M M I M M M M I I M I 

541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 
Ov 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 

M M M I M M M M M M M M M M M M M I M M M M M M M M M M M M M 

Db 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 

Ov 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 

I I I I I I I I I I M M M M M M M I M M M M M M M M M M I M M I M 

Db 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 



360 

360 

420 

420 

480 

480 

540 

540 

600 

600 

660 

660 

720 

720 



721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

mil IIIIIIIIIMIII llMllMlllMlllllllilMII 

721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 
781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIMIIIIIIMIMIIMIIIII 

7 8 1 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 
841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 N 1 1 N 1 1 1 liiiiiiililiiliLiLiiUliiij. 

841 — — ----'^'"^'^ 



TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCCTCCACAGACTGG 900 

AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I I I I I II I I II II II t II II II II I II II II I I I I I I I M I I I I I I I I I I M I I I I I I I I 
AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 



1020 



961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 

I Mil I MM III III I M 1 I M I M I 1 I I I M I I 1 I I I I I I M i M I It I I I I I 

C(yjvTTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 



961 



1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 

11 II I I I I I I I I I M 11 II 1 I II I 1! I I I M I I I N 11 M I II II I I II 

1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 



1080 
1080 
1140 



1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 

II II I I I I M I I I I II M I I M 1 I II II M 1 1 II IN I I M I I I I M M I I 11 I 

1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 114 0 



1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 

II M 11 1 1 I I 11 I I I I I I I I I I I I II I M 11 I M I M I I I I M I I M I M I I M I I Ml I 
1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 

1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 

MMllllMIIIMIIIIIMllilMIMIMIllMllllMMIIMMMIIIll 

1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 
12 61 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 

I I I M 11 11 I I I I I I I I I 1 11 1 N I II I I I t I I 11 I 1 ill I II I I M I I 11 I I I I I I I I I 

1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 

1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 

I II 11 11 M I I I I I I I I M I I I I I I II I M I I I I I 11 I M II I I I 11 I I I 1 I II I I I I 1 I 
1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 

1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 

M I I I I I I I I I I I I M I I I 1 I M I I I I 11 I I I I I I I M I I I 1 I M I I I I 1 I 

1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 
1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 

II I M 11 I I I I I I I I 11 I I 11 I M M I I I I I I 1 M 1 I I I I M I I I I I I I M I li I I I 1 M 

1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 

1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 
M I I I I I I 11 I I I I I I I 1 1 I I I I I I I M M I 1 1 I I I I I I I I I I I I I I I I I I 1 1 I I I I I 1 I 
1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 



1200 

1200 

1260 

1260 

1320 

1320 

1380 

1380 

1440 

1440 

1500 

1500 

1560 

1560 



Qv 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAA^ 1620 

lYiiMIIIIIIIIMIIIMIIMlllllllMMIIlllMMIMIIMMMMM 

Db 1561 ciiiiiiAiG/J^^ 

1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

Qy 1621 ^^^^^^y^^ 

QV 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGA^^^^ 1740 

IMIIIIMIIIIMMMMIIMIIIMIMIIIIIIMMIIIIIIIIIIIII^^^ 

1681 iAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTG^^^ 1740 



Db 

Qy 



1741 TGA 1743 
I I I 

Db 1741 TGA 1743 



RESULT 2 

US-08-232-463-14 

Sequence 14, Application US/08232463 
Patent No. 5670367 
GENERAL INFORMATION: 

APPLICANT: DORNER, F. 
APPLICANT: SCHEIFLINGER, F. 
APPLICANT: FALKNER, F. G. 

TITLE OF INVENTION: RECOMBINANT FOWLPOX VIRUS 
NUMBER OF SEQUENCES: 52 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Foley & Lardner 
STREET: 1800 Diagonal Road, Suite 500 
CITY: Alexandria 
STATE: VA 
COUNTRY: USA 
ZIP: 22313-0299 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentin Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/232, 463 
FILING DATE: 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/07/935,313 
FILING DATE: 

APPLICATION NUMBER: EP 91 114 300.6 
FILING DATE: 26-AUG-1991 
ATTORNEY/AGENT INFORMATION: 
NAME: BENT, Stephen A. 
REGISTRATION NUMBER: 29,768 
REFERENCE/DOCKET NUMBER: 30472/114 IMMU 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (703)836-9300 
TELEFAX: (7 03)683-4109 
TELEX: 899149 
INFORMATION FOR SEQ ID NO: 14: 



SEQUENCE CHARACTERISTICS: 
LENGTH: 7218 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 
IMMEDIATE SOURCE: 
CLONE: pTZgpt-Fls 
US-08-232-463-14 

Query Match 2.7%; Score 47.6; DBl; Length 7218; 

Best Local Similarity 6.0%; Pred. No. 0 00054; 

Matches 23; Conservative 200; Mismatches 159; Indels 0, Gaps 
Ov 692 CTGTTGACTCATCTGAAGTCTACTCTTGGCTTGATAGTTTTCTGTTGTTGATGCTGGGTG 751 

... ••• ••••• ! ::::*• •• • ■* * 

: : : : :::::: t i /i- 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Ov 1052 CCATCTTGTCAGCAAGTTCCAT 1073 

I Mil I I I I M III 
1448 CTATCTCTTTAACTACTTGCAT 1469 

RESULT 3 

US-09-640-198D-3 

Sequence 3, Application US/09640198D 
Patent No. 6586411 
GENERAL INFORMATION: 
APPLICANT: Russell, Stephen 
APPLICANT: Kay Whye, Peng 

TITLE OF INVENTION: System for Monitoring the Location ot 
TITLE OF INVENTION: Transgenes 
FILE REFERENCE: 07039-295001 
CURRENT APPLICATION NUMBER: US/09/640 , 198D 
CURRENT FILING DATE: 2000-08-16 
PRIOR APPLICATION NUMBER: US 60/149,168 
PRIOR FILING DATE: 1999-08-17 
NUMBER OF SEQ ID NOS : 34 



w-^*- — • • • : : : : : : 

1088 ^YYY^YYYYYYY^YYYYYYYYYYY^^^ ^^^^ 
752 GAATCCCATGGCAAGCATACTTTCAGAGGGTTCTCTCTTCTTCCTCAGCCACCTATGCTC 811 

1148 YYYYYiYYYYYiYYY^YYYYYYYYYYYYYYY^ 1207 
812 AAGTGCTGTCCTTCCTGGCAGCTTTCGGGTGCCTGGTGATGGCCATCCCAGCCATACT^ 871 

1208 YYYYYYYY^YYYYYiYYYYYYYYYY^^^^ 67 

872 TTGGGGCCATTGGAGCATCAACAGACTGGAACCAGACTGCATATGGGCTTCCAGATCCCA 931 

1268 ^^YYYYYYY^YYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYYY 1327 

932 AGACTACAGAAGAGGCAGACATGATTTTACCAATTCTTCTGCAGTATCTCTGCC^ 991 

1328 YYYYYYYYYYYYYYYYYYYYYYYY^YYY^YYY^YYYYYYYYYYY^^^^ 1387 

992 ATATTTCTTTCTTTGGTCTTGGTGCAGTTTCTGCTGCTGTTATGTCATCAGCAGATTCT 1051 

1388 YYYYYYYYYYYY^^YYYYYYYYYYYYYY^^ ^^^"^ 



SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 3 
LENGTH: 1857 
TYPE: DNA 

ORGANISM: Rattus sp. 
US-09-640-198D-3 



. , 2 4%; Score 41; DB 4; Length 1857; 

Query Match ^,'±-0, 

TGATGGGAGAAATGTTCTGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCG 

II,,, I I I I I II I I II M I M I I I I 

TGGTGGCCACGATGCTGTATACAGGCATCGTGATCTACGCGCCTGCGCTCATCCTGAACC 

TGATCATCGATGTGGATATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGT 

1,1, I I II I I M 11 I I I I I I I I I III III 

AAGTGACCGGGTTGGACATCTGGGCATCGCTCCTGTCCACAGGAATCATCTGCACCTTGT 

ACACACTGGTGGGAGGGCTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCA 

I II I I I M I M 1 I I I M I I I 1 I I 11 I I Mill I I 

ACACTACCGTGGGTGGTATGAAGGCCGTGGTCTGGACAGATGTGTTCCAGGTTGTGGTAA 



Qy 


404 


Db 


413 


Qy 


464 


Db 


473 


Qy 


524 


Db 


533 


Qy 


584 


Db 


593 



463 
472 
523 
532 
583 
592 



I I I I I I I I I I I I I I II 



RESULT 4 

US-08-595-553A-1 

Sequence 1, Application US/08595553A 
Patent No. 6391579 
GENERAL INFORMATION: 

APPLICANT: NANCY CARRASCO, ET AL. 

?ITLE OF INVENTION: THYROID SODIUM/IODIDE SYMPORTER AND 
TITLE OF INVENTION: NUCLEIC ACID ENCODING SAME 
NUMBER OF SEQUENCES: 2 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: AMSTER, ROTHSTEIN & EBENSTEIN 
STREET: 90 PARK AVENUE 
CITY: NEW YORK 
STATE: NEW YORK 
COUNTRY: U.S.A. 
ZIP: 10016 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3.5 INCH 1.44 Mb STORAGE 
MEDIUM TYPE: DISKETTE 
COMPUTER: IBM PC COMPATIBLE 
OPERATING SYSTEM: MS-DOS 
SOFTWARE: ASCII 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08/ 595, 553A 
FILING DATE: FEBRUARY 1, 1996 
ATTORNEY/AGENT INFORMATION: 
NAME: CRAIG J. ARNOLD 
REGISTRATION NUMBER: 34,287 
REFERENCE/ DOCKET NUMBER: 96700/393 



TELECOMMUNICATION INFORMATION: 
TELEPHONE: (212) 697-5995 
TELEFAX: (212) 286-0854 or 286-0082 
TELEX: TWX 710-581-4766 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 2839 
TYPE: NUCLEIC ACID 
STRANDEDNESS : DOUBLE 
TOPOLOGY: LINEAR 
MOLECULE TYPE: 

DESCRIPTION: OLIGONUCLEOTIDE 
HYPOTHETICAL: NO 
ANTI-SENSE: NO 
ORIGINAL SOURCE: 
ORGANISM: RAT 

INDIVIDUAL ISOLATE: SODIUM/IODIDE SYMPORTER 
US-08-595-553A-1 

Query Match 2.4%; Score 41; DB 4; Length 2839; 

Best Local Similarity 49.8%; Pred- No. 0-035; 

Matches 104; Conservative 0; Mismatches 105; Indels 0, Gaps U, 
ov 404 TGATGGGAGAAATGTTCTGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCG 463 

Mill I I I I I II I III I I I I I I I I I 

Db 522 TGGTGGCCACGATGCTGTATACAGGCATCGTGATCTACGCGCCTGCGCTCATCCTGAACC 581 

Ov 464 TGATCATCGATGTGGATATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGT 52 3 

^ I I 11 II II I 1 II I I I 1 1 I I 1 I I I I II 1 t I 

Db 582 AAGTGACCGGGTTGGACATCTGGGCATCGCTCCTGTCCACAGGAATCATCTGCACCTTGT 641 

583 



nu 524 ACACACTGGTGGGAGGGCTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCA 

^ III! Mill II I I I MM II II Mill I Ml I I I 

Db 642 ACACTACCGTGGGTGGTATGAAGGCCGTGGTCTGGACAGATGTGTTCCAGGTTGTGGTAA 701 



Q„ 584 TTTTTGTAGGGCTGTGGATCAGCGTCCCC 612 

I I II M I Ml I 1 I I II 
702 TGCTCGTTGGCTTCTGGGTGATCCTGGCC 730 



RESULT 5 

US-09-621-976-18033 

Sequence 18033, Application US/09621976 
Patent No. 6639063 
GENERAL INFORMATION: 
APPLICANT: Dumas Milne Edwards, J.B. 
APPLICANT: Jobert, S. 
APPLICANT: Giordano, J.Y. 

TITLE OF INVENTION: ESTs and Encoded Human Proteins. 
FILE REFERENCE: GENSET . 054PR2 
CURRENT APPLICATION NUMBER: US/09/621,976 
CURRENT FILING DATE: 2000-07-21 
NUMBER OF SEQ ID NOS : 19335 
SOFTWARE: Patent. pm 
SEQ ID NO 18033 
LENGTH: 474 
TYPE: DNA 



ORGANISM: Homo sapiens 
FEATURE: 

NAME/ KEY: misc_feature 
LOCATION: 16 

OTHER INFORMATION: n=a, q, c or t 
US-09-621-976-18033 



Query Match 2.3%; Score 39.6; DB4; Length 474; 

Best Local Similarity 13.4%; Pred. No. ^.03; 

Matches 42; Conservative 134; Mismatches 138; Indels 0, Gaps 0. 

CTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTTTCTGCTGCTGTTATGTCATC 104 0 
: : I : : : I I I I : : : : 1 M M : : : = I : M : I = : = . . 



Qy 


981 


Db 


54 


Qy 


1041 


Db 


114 


Qy 


1101 


Db 


174 


Qy 


1161 


Db 


234 


Qy 


1221 


Db 


294 


Qy 


1281 


Db 


354 



AGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCACGGAACATCTACCAGCTTTC 
r ! I : : : : : : : : : : : : I : 



1100 



CTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTTATGCGAATCACAGTGTTTGT 1160 
- - I I : 1 ::::::::: I I : I : : I I : I I = • 



I : I : : : | : : : : : | ::::::::::::: I = I 



CCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAGCTGCTTTGTGTACTCTTTGT 128 

. I : I : : | : : : : I : I : : : : : : : I : :::::::: 
294 ARKTTYYTWAWYYTTYYKRMCCYYMRKTTYCMMMWYSRWWRGSMWTARGAWW^ 353 



: I 



RESULT 6 

US-09-328-352-2245 

Sequence 2245, Application US/09328352 
Patent No. 6562958 
GENERAL INFORMATION: 
APPLICANT: Gary L. Breton et al . 

??TLE OF invention: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 

APTMFTOBACTER 

TITLE OF INVENTION: BAUMANNII FOR DIAGNOSTICS AND THERAPEUTICS 
FILE REFERENCE: GTC99-03PA 

CURRENT APPLICATION NUMBER: US/09/328 , 352 
CURRENT FILING DATE: 1999-06-04 
NUMBER OF SEQ ID NOS : 8252 
SEQ ID NO 2245 
LENGTH: 1506 
TYPE: DNA 

ORGANISM: Acinetobacter baumannii 
US-09-328-352-2245 

Query Match 2.2%; Score 39.2; DB 4; Length 1506; 

Best Local Similarity 51.1%; Pred. No. 0.085; 



Matches 92; Conservative 0; Mismatches 88; Indels 0; Gaps 0; 
Ov 1008 TCTTGGTGCAGTTTCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAG 1067 

I I I I I I I I I I I I I I I I I I I I I 111 II Mill 

Db 1005 TckGCTGCTATTTTAGCTGCGGTTATGAGTACATTAAGCTGTCAGCTTTTGGTATGTTC 1064 

Qv 1068 TTCCATGTTTGCACGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAG^ 1127 

II I I I I I I I I 1 I M 1 M 11 I 1 I M M I 

Db 1065 AAGTGCACTAACTGAAGATTTGTACAAAGGCTTCATTCGTAAAAATGCATCTCAAAAAGA 1124 

Ov 1128 AATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGC 1187 

^ I I I I I I I I I 11 I I I I I I 1 I I I 1 IN I I'll I III 

Db 1125 GCTTGTATGGGTTGGGCGTATCATGGTGCTTGCAATTGCCGTTCTAGCAATTGTGCTTGC 1184 

RESULT 7 

US-09-328-352-3451 

Sequence 3451, Application US/09328352 
Patent No. 6562958 
GENERAL INFORMATION: 

appT.TCANT: Garv L. Breton et al. 

JiTLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 

'"'"'^iSrOF'^^NVENTION: BAUMANNII FOR DIAGNOSTICS AND THERAPEUTICS 
FILE REFERENCE: GTC99-03PA 

CURRENT APPLICATION NUMBER: US/09/328,352 
CURRENT FILING DATE: 1999-06-04 
NUMBER OF SEQ ID NOS : 8252 
SEQ ID NO 3451 
LENGTH: 558 
TYPE: DNA 

ORGANISM: Acinetobacter bavimannii 
US-09-32B-352-3451 

Query Match 2.2%; Score 39; DB 4; Length 558; 

Best Local Similarity 50.3%; Pred. No. 0.051; 

Matches 96; Conservative 0; Mismatches 95; Indels 0, Gaps 0, 
ov 1455 TACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTATCTATTTGAAAGTGG 1514 

I I I I I I II III I I I 1 I I I I 1 1 II 1 IN Ml 1 

Db 341 TAAATCAAAATGATGCAAATGCTTCATGGCTGATGTTGCAAACTTCAACTTTTCAAGATG 400 

1515 AACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAA 1574 

I I I II II III I 1 1 I I II 111 11 1 II ' " 

Db 401 GCCGTAGTCATCTGAATGCGGCAAAGCTCAAGGTGAAGTTTCAGAAGCAAGCAGATGGAA 460 

OV 1575 CATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGATGAACTTGCACTTGT 1634 

I I II II I III 11 M I I I I'll'',, 

Db 461 CATGGAAAATTAAACATTTCCAAACACAGAATATTTTCAGTCGTCCGGTATCGCATTGGC 520 

Qv 1635 GAAGCCACGAC 1645 

111 1 1 I 
Db 521 AAAGTGAAGCC 531 



RESULT 8 

US-10-162-012-28 



Sequence 28, Application US/10162012 
Patent No. 6682597 
GENERAL INFORMATION: 

APPLICANT: Curtis, Rory A.J. 

APPLICANT : Silos-Santiago, Inmaculada 

JiTLe'^S^^INVENTION^ novel human ion CHANNEL AND TRANSPORTER FAMILY MEMBERS 
FILE REFERENCE: 10448-190001 
CURRENT APPLICATION NUMBER: US/ 10/ 162 , 012 
CURRENT FILING DATE: 2002-06-04 
PRIOR APPLICATION NUMBER: US 60/209,845 
PRIOR FILING DATE: 2000-06-06 
PRIOR APPLICATION NUMBER: US 09/875,321 
PRIOR FILING DATE: 2001-06-06 
PRIOR APPLICATION NUMBER: PCT/USOl/18340 
PRIOR FILING DATE: 2001-06-06 
PRIOR APPLICATION NUMBER: US 60/209,257 
PRIOR FILING DATE: 2000-06-05 
PRIOR APPLICATION NUMBER: US 09/875,423 
PRIOR FILING DATE: 2001-06-05 
PRIOR APPLICATION NUMBER: PCT/USOl/18398 
PRIOR FILING DATE: 2001-06-05 
PRIOR APPLICATION NUMBER: US 60/209,238 
PRIOR FILING DATE: 2000-06-05 
PRIOR APPLICATION NUMBER: US 09/875,363 
PRIOR FILING DATE: 2001-06-05 
PRIOR APPLICATION NUMBER: PCT/USOl/18247 
PRIOR FILING DATE: 2001-06-05 
PRIOR APPLICATION NUMBER: US 60/227,068 
PRIOR FILING DATE: 2000-08-22 
PRIOR APPLICATION NUMBER: US 09/928,530 
PRIOR FILING DATE: 2001-08-13 
PRIOR APPLICATION NUMBER: PCT/USOl/25475 
PRIOR FILING DATE: 2001-08-15 
PRIOR APPLICATION NUMBER: US 60/226,770 
PRIOR FILING DATE: 2000-08-21 
PRIOR APPLICATION NUMBER: US 09/934,421 
PRIOR FILING DATE: 2001-08-21 
PRIOR APPLICATION NUMBER: PCT/USOl/26096 
PRIOR FILING DATE: 2001-08-21 
PRIOR APPLICATION NUMBER: US 60/279,281 
PRIOR FILING DATE: 2001-03-28 
PRIOR APPLICATION NUMBER: US 10/109,029 
PRIOR FILING DATE: 2002-03-28 
PRIOR APPLICATION NUMBER: PCT/US02/09728 
PRIOR FILING DATE: 2002-03-28 
PRIOR APPLICATION NUMBER: US 60/290,288 
PRIOR FILING DATE: 2001-05-11 
PRIOR APPLICATION NUMBER: US (not assigned) 
PRIOR FILING DATE: 2002-05-13 
NUMBER OF SEQ ID NOS : 48 

SOFTWARE: FastSEQ for Windows Version 4,0 
SEQ ID NO 28 
LENGTH: 2028 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-162-012-28 



Query Match 2.2%; Score 38.2; DB 4; Length 2028; 

Best Local Similarity 46.1%; Pred. No. 0.22; 

Matches 239; Conservative 0; Mismatches 273; Indels 7; Gaps J; 



Ov 79 TGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAAGCCATCATAGTTGGTGGC 

Mil I I I MM 111 1 Ml 111 II III i 

Db 126 TGGACTATGGTCCACAGTGAAGACCAAAAGAGACACAGTGAAAGGCTACTTCCTGGCTGA 



138 
185 



245 



Ov 139 CGAGATATTGGTTTATTGGTTGGTGGATTTA-CCATGACAGCTACCTGGGTCGGAGGAGG 197 

III M I I I I I 11 I I I 1 I I Ill 

Db 186 AGGGAACATGGTGTGGTGGCCAGTGGGTGCATCCTTGTTTGCCAGCAATGTTGGAAGTGG 

Ov 198 GTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTATGGCCTAGCTTGGGCTCA 257 

M III MM II I Ml I M III I II I I'l I 
Db 246 ACATTTCATTGGCCTGGCAGGGTCAGGTGCTGCTACGGGCATTTCTGTA TCAGCTTA 302 

Ov 258 GGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTGTTCTTTGCAAAACCTAT 317 

I II I I II I I I I I M I I I I I I II N 

Db 303 TGAACTTAATGGCTTGTTTTCTGTGCTGATGTTGGCCTGGATCTTCCTACCCATCTACAT 362 

Ov 318 GCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAAATCTATGGAAAACGCAT 377 

I Ml M I III ill II M 

Db 363 TGCTGGTCAGGTCACCACGATGCCAGAATACCTACGGAAGCGCTTCGGTGGCATCAGAAT 422 

Ov 378 GGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTCTGGGCTGCAGCAATTTT 437 

I Mill I I I II I I M III II 

Db 423 CCCCATCATCCTGGCTGTACTCTACCTATTTATCTACATCTTCACCAAGATCTCGGTAGA 482 

Ov 438 CTCTGCTTTGGGAGCCACCATCAGCGTGATCATCG— ATGTGGATATGCACATTTCTGT 494 

I I II MM I Ml I I I Mill II M I I I 

Db 483 CATGTATGCAGGTGCCATCTTCATCCAGCAGTCTTCGCACCTGGATCTGTACCTGGCCAT 542 

Ov 495 CATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGGCTCTATTCTGTGGC 554 

^ 1 II I I I I I I I I II I I M II M I MM I 



Db 543 



AGTTGGGCTACTGGCCATCACTGCTGTATACACGGTTGCTGGTGGCCTGGCTGCTGTGAT 



602 



Qy 555 CTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGG 593 

Mllll MM I I Ml I Mi II MM 

Db 603 CTACACGGATGCCCTGCAGACGCTGATCATGCTTATAGG 641 



RESULT 9 

US-10-162-012-26 

; Sequence 26, Application US/10162012 

; Patent No. 6682597 

; GENERAL INFORMATION: 

; APPLICANT: Curtis, Rory A.J. 

; APPLICANT: Silos-Santiago, Iranaculada 

" APPLICANT I Gu. ^ Wei 

; TITLE OF INVENTION: NOVEL HUMAN ION CHANNEL AND TRANSPORTER FAMILY MEMBERS 

; FILE REFERENCE: 10448-190001 

; CURRENT APPLICATION NUMBER: US/ 10/ 162 , 012 

; CURRENT FILING DATE: 2002-06-04 

; PRIOR APPLICATION NUMBER: US 60/209,845 

; PRIOR FILING DATE: 2000-06-06 

; PRIOR APPLICATION NUMBER: US 09/875,321 



PRIOR FILING DATE: 2001-06-06 

PRIOR APPLICATION NUMBER: PCT/USOl/ 18340 

PRIOR FILING DATE: 2001-06-06 

PRIOR APPLICATION NUMBER: US 60/209,257 

PRIOR FILING DATE: 2000-06-05 

PRIOR APPLICATION NUMBER: US 09/875,423 

PRIOR FILING DATE: 2001-06-05 

PRIOR APPLICATION NUMBER: PCT/USOl/18398 
; PRIOR FILING DATE: 2001-06-05 
I PRIOR APPLICATION NUMBER: US 60/209,238 
; PRIOR FILING DATE: 2000-06-05 
r PRIOR APPLICATION NUMBER: US 09/875,363 
; PRIOR FILING DATE: 2001-06-05 
r PRIOR APPLICATION NUMBER: PCT/USOl/ 18247 
; PRIOR FILING DATE: 2001-06-05 
; PRIOR APPLICATION NUMBER: US 60/227,068 
; PRIOR FILING DATE: 2000-08-22 
; PRIOR APPLICATION NUMBER: US 09/928,530 
; PRIOR FILING DATE: 2001-08-13 
; PRIOR APPLICATION NUMBER: PCT/USOl/25475 
; PRIOR FILING DATE: 2001-08-15 
; PRIOR APPLICATION NUMBER: US 60/226,770 
; PRIOR FILING DATE: 2000-08-21 
; PRIOR APPLICATION NUMBER: US 09/934,421 
; PRIOR FILING DATE: 2001-08-21 
; PRIOR APPLICATION NUMBER: PCT/USO 1/2 6096 
; PRIOR FILING DATE: 2001-08-21 
; PRIOR APPLICATION NUMBER: US 60/279,281 
; PRIOR FILING DATE: 2001-03-28 
; PRIOR APPLICATION NUMBER: US 10/109,029 
; PRIOR FILING DATE: 2002-03-28 
; PRIOR APPLICATION NUMBER: PCT/US02/09728 
; PRIOR FILING DATE: 2002-03-28 
; PRIOR APPLICATION NUMBER: US 60/290,288 
; PRIOR FILING DATE: 2001-05-11 

PRIOR APPLICATION NUMBER: US (not assigned) 
; PRIOR FILING DATE: 2002-05-13 
; NUMBER OF SEQ ID NOS : 48 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 26 

LENGTH: 2326 
; TYPE : DNA 
; ORGANISM: Homo sapiens 
; FEATURE : 

NAME/ KEY: CDS 
; LOCATION: ( 178 )... (2202 ) 
US-10-162-012-26 

Query Match 2.2%; Score 38.2; DB 4; Length 2326; 

Best Local Similarity 46.1%; Pred. No. 0.24; 

Matches 239; Conservative 0; Mismatches 273; Indels 7; Gaps 3 
79 TGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAAGCCATCATAGTTGGTGGC 138 

MM I INI II III I III III II III I 

303 TGGACTATGGTCCACAGTGAAGACCAAAAGAGACACAGTGAAAGGCTACTTCCTGGCTGA 362 



139 CGAGATATTGGTTTATTGGTTGGTGGATTTA-CCATGACAGCTACCTGGGTCGGAGGAGG 197 



Ill 1 1 11 I III 1 1 1 1 I I 1 1 1 1 1 1 I I III"''',, 

Db 363 AGGGAACATGGTGTGGTGGCCAGTGGGTGCATCCTTGTTTGCCAGCAATGTTGGAAGTGG 422 

Ov 198 GTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTATGGCCTAGCTTGGGCTCA 257 

I I I II I I II III I I I I II III I II I II I I 
Db 423 ACATTTCATTGGCCTGGCAGGGTCAGGTGCTGCTACGGGCATTTCTGTA TCAGCTTA 479 



ov 258 GGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTGTTCTTTGCAAAACCTAT 317 

I II I III I I I I IIMI II I I I IN I I I II 
Db 480 TGAACTTAATGGCTTGTTTTCTGTGCTGATGTTGGCCTGGATCTTCCTACCCATCTACAT 539 

ov 318 GCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAAATCTATGGAAAACGCAT 377 

1 III II II I ill I I II 

Db 540 TGCTGGTCAGGTCACCACGATGCCAGAATACCTACGGAAGCGCTTCGGTGGCATCAGAAT 599 



Ov 378 GGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTCTGGGCTGCAGCAATTTT 437 

^ 1 Mill I I I II I I II III II 

2CCCATCATCCTGGCTGTACTCTACCTA' 

CTCTGCTTTGGGAGCCACCATCAGCGTG 
660 (LtGTATGCAGGTGCCATCTTCATCCAGCAGTCTTCGCACCTG^ 719 

ov 495 CATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGGCTCTATTCTGTGGC 554 

I M I I M I I I I I I I I M I I II I I I I I I 

Db 720 aGTTGGGCTACTGGCCATCACTGCTGTATACACGGTTGCTGGTGGCCTGGCTGCTGTGAT 779 

Ov 555 CTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGG 593 

null INI I I ill I III II III" 

Db 780 CTACACGGATGCCCTGCAGACGCTGATCATGCTTATAGG 818 



Db 600 cCCCATCATCCTGGCTGTACTCTACckTTTATCTACAT^ 659 

ov 438 CTCTGCTTTGGGAGCCACCATCAGCGTGATCATCG ATGTGGATATGCACATTTCTGT 494 

, Ill I i II I I I I I I I I ' 



RESULT 10 
US-09-557-884-1/C 

; Sequence 1, Application US/09557884 

; Patent No. 6506581 

; GENERAL INFORMATION: 

APPLICANT: Fleischmann et al . 

TITLE OF INVENTION: The Nucleotide sequence of 

the Haemophilus influenzae Rd Genome, Fragments 
' Thereof, and Uses Thereof 

• NUMBER OF SEQUENCES: 1 

; CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 
'. STREET: 9410 Key West Avenue 

. CITY: Rockville 

STATE: MD 
; COUNTRY: USA 

ZIP: 20850 
; COMPUTER READABLE FORM: 

; MEDIUM TYPE: 3 1/2 inch diskette 

; COMPUTER: Dell Pentium 

; OPERATING SYSTEM: MS DOS v6.22 

; SOFTWARE: ASCII Text 

; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/09/557, 884 

FILING DATE: 25-Apr-2000 



CLAS S I FI CAT I ON : <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/476,102 
FILING DATE: JUN-5-1995 
ATTORNEY/AGENT INFORMATION: 
NAME: Michelle S. Marks 
REGISTRATION NUMBER: 41,971 
REFERENCE/ DOCKET NUMBER: PB186P3 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-309-8504 
TELEFAX: 301-309-8439 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1830121 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-09-557-884-1 

Query Match 2.2%; Score 38.2; DB 4; Length 1830121; 

Best Local Similarity 48.7%; Pred. No. 19; 

Matches 132; Conservative 0; Mismatches 138; Indels 1, Gaps 1. 
Ov 1009 CTTGGTGCAGTTTCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGT 1068 

111 1! II I I I I I M I I I I I I I I III I I 

Db 1428785 CTTTCCGCTATTTTAGCAGCAGTAATGAGTACATTAAGTGCGCAATTGTTAATTTCCTCT 

1428726 

ov 1069 TCCATGTTTGCACGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAA 1128 

I III I II I I I II I I t II II II 1 I II I II 

Db 1428725 aGCTCAATCACAGAAGATTTCTATAAAGGTTTTATTCGCCCTAACGCATCTGAAAAAGAG 

1428666 

ov 1129 ATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGC- 1187 

1 1 1 1 II I II 1 1 II I 1 1 1 1 1 1 1 III III 1 1 1 I 1 1 1 1 

Db 1428665 CTCGTATGGCTTGGCAGAATTATGGTGTTAGTTATTGCCGCACTTGCTATCTGGATCGCA 
1428606 

OV 1188 CTTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGT 1247 

^ I I I II I I I Mil MM! M I I I I I I I 

Db 1428605 cAAGATGAAAACAGCAAAGTATTAAAACTTGTAGAATTTGCTTGGGCGGGGTTTGGTAGT 
1428546 



Ov 1248 TATCTTCCCCCAGCTGCTTTGTGTACTCTTT 1278 

M I I I I II II MINI 

Db 1428545 GCATTTGGCCCTGTTGTACTTTTCTCTCTTT 1428515 



RESULT 11 

US-09-643-990A-1/C 

; Sequence 1, Application US/09643990A 

; Patent No. 6528289 

; GENERAL INFORMATION: 

; APPLICANT: Robert D. Fleischmann 

Mark D. Adams 
Owen White 



Hamilton O. Smith 
J. Craig Venter 
TITLE OF INVENTION: The Nucleotide sequence of 

the Haemophilus influenzae Rd Genome, Fragments 
Thereof, and Uses Thereof 
NUMBER OF SEQUENCES: 1 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: Human Genome Sciences, Inc. 
STREET: 9410 Key West Avenue 
CITY: Rockville, 
STATE: MD 
COUNTRY: USA 
ZIP: 20850 
COMPUTER READABLE FORM: 

MEDIUM TYPE: 3 1/2 inch diskette 
COMPUTER: Dell Pentium 
OPERATING SYSTEM: MS DOS v6.22 
SOFTWARE: ASCII Text 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 09/ 643 , 990A 
FILING DATE: 23-Aug-2000 
CLAS S I FI CAT I ON : <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/487,429 
FILING DATE: 1995-06-07 
APPLICATION NUMBER: 08/426,787 
FILING DATE: 1995-04-21 
ATTORNEY/AGENT INFORMATION: 
NAME: Kenley K. Hoover 
REGISTRATION NUMBER: 40,302 
REFERENCE/ DOCKET NUMBER: PB186P1C1 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 301-610-5790 
TELEFAX: 310-309-8439 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 1830121 base pairs 
TYPE: nucleic acid 
STRANDEDNESS: double 
TOPOLOGY: linear 
SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
US-09-643-990A-1 

Query Match 2.2%; Score 38.2; DB 4; Length 1830121; 

Best Local Similarity 48.7%; Pred. No. 19; ^ . . i- 

Matches 132; Conservative 0; Mismatches 138; Indels 1; Gaps 1, 

Ov 1009 CTTGGTGCAGTTTCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGT 1068 

111 II I I I I I 1 I M I I I II I I I Ml I ' 

Db 1428785 CTTTCCGCTATTTTAGCAGCAGTAATGAGTACATTAAGTGCGCAATTGTTAATTTCCTCT 
1428726 

ov 1069 TCCATGTTTGCACGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAA 1128 

I 111 I INI I II I I I II II II II I IN I 

Db 1428725 AGCTCAATCACAGAAGATTTCTATAAAGGTTTTATTCGCCCTAACGCATCTGAAAAAGAG 
1428666 



Qv 1129 ATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGC- 1187 

I M I I M I I Mill I I I I I II I II Ml MM Mil 

Db 1428665 CTCGTATGGCTTGGCAGAATTATGGTGTTAGTTATTGCCGCACTTGCTATCTGGATCGCA 
1428606 

Qy 1188 



CTTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGT 1247 



I I I I I I I II I I MM! II I I M III 

Db 1428605 CAAGATGAAAACAGCAAAGTATTAAAACTTGTAGAATTTGCTTGGGCGGGGTTTGGTAGT 

1428546 

Qy 1248 TATCTTCCCCCAGCTGCTTTGTGTACTCTTT 1278 

II Ml Ml II MMM 

Db 1428545 GCATTTGGCCCTGTTGTACTTTTCTCTCTTT 1428515 



RESULT 12 
US-09-640-198D-1 

Sequence 1, Application US/09640198D 
Patent No. 6586411 
GENERAL INFORMATION: 
APPLICANT: Russell, Stephen 
APPLICANT: Kay Whye, Peng 

TITLE OF INVENTION: System for Monitoring the Location of 
TITLE OF INVENTION: Transgenes 

FILE REFERENCE: 07039-295001 

CURRENT APPLICATION NUMBER: US/09/640, 198D 
CURRENT FILING DATE: 2000-08-16 
PRIOR APPLICATION NUMBER: US 60/149,168 
PRIOR FILING DATE: 1999-08-17 
NUMBER OF SEQ ID NOS : 34 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 1 
LENGTH: 1932 
TYPE: DNA 

ORGANISM: Homo Sapiens 
US-09-640-198D-1 

Query Match 2.2%; Score 38; DB 4; Length 1932; 

Best Local Similarity 51.8%; Pred. No. 0.24; 

Matches 86; Conservative 0; Mismatches 80; Indels 0; Gaps 0; 
Ov 437 TCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGATATGCACATTTCTGTCA 496 

III II II MMM MM MMM " ' ' 

Db 446 TCTACGCACCGGCCCTCATCCTGAACCAAGTGACCGGGCTGGACATCTGGGCGTCGCTCC 505 

Ov 497 TCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGGCTCTATTCTGTGGCCT 556 

I I I I I I II III MMM II Mill I I MMM M 
Db 506 TGTCCACCGGAATTATCTGCACCTTCTACACGGCTGTGGGCGGCATGAAGGCTGTGGTCT 565 

Q 557 ACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGGAT 602 

IMIIMI I Ml II I Ml Ml Ml I 

Db 566 GGACTGATGTGTTCCAGGTCGTGGTGATGCTAAGTGGCTTCTGGGT 611 



RESULT 13 
US-09-601-198-165 

; Sequence 165, Application US/09601198 



Patent No. 6531583 
GENERAL INFORMATION: 

APPLICANT: Cassell, Gail H. 

APPLICANT: Chen, Ellson Y. 

APPLICANT: Glass, Jennifer S. 

APPLICANT: Glass, John I. 

APPLICANT: Heiner, Cheryl R. 

???^ETiKVEN"oN:"icLMC°ACID PROBES «.D METHOD TOR DETECTING UREAPIASMA 

TITLE OF INVENTION: UREAL YTICUM 
FILE REFERENCE: UAB-134 52/22 
CURRENT APPLICATION NUMBER: US/ 09/ 601 , 198 
CURRENT FILING DATE: 2000-12-08 
PRIOR APPLICATION NUMBER: 60/073,189 
PRIOR FILING DATE: 1998-01-30 
NUMBER OF SEQ ID NOS : 181 
SOFTWARE: PatentIn Ver. 2.0 
SEQ ID NO 165 

LENGTH: 4344 

TYPE: DNA 

ORGANISM: Ureaplasma urealyticum 
US-09-601-198-165 

Query Match 2.1%; Score 37.4; DB4; Length 4344; 

Best Local Similarity 48.0%; Pred. No. 0.64; 

Matches 107; Conservative 0; Mismatches 116; Indels 0; Gaps 0, 
^TGATAATGGTATATATAATCAGAAATTTCCATTTAAAACACTTGCCATGGTTACATC 1460 

I I I I I I I I I I I I I I I I M M I I III 

TTGATCAATCAGTAGATTTTTTAAAAGTAAATATTGAAGCATTAATTAATCATCAACC 923 



Qy 


1401 


Db 


864 


Qy 


1461 


Db 


924 


Qy 


1521 


Db 


984 


Qy 


1581 


Db 


1044 



ATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTATCTATTTGAAAGTGGAACCTT 

II M M I M I I III I I I I I I I I I I I I I I 

ACTTAAAAACACAACATGAAACGATTTTATTAACAAAAATGTTACAGATATTAGTGCTTT 

GCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGA 

M I I M I I t I I I II II t I I 1 I II I WW 

9 8 4 AAGTAACTTATTAGAAATTTTTGAAACTAATGAAATTACAAATAATGAATGAAACCAATT 

TAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGATGAA 1623 

I MINI I II I I I I I M I I I MM 

AATTACGATTTTAATTAATCATGCACCTATTGATAAAATTGAA 1086 



1520 
983 
1580 
1043 



RESULT 14 
US-09-134-218-1/C 

Sequence 1, Application US/09134218A 
Patent No. 6312926 
GENERAL INFORMATION: 
APPLICANT: Shatkin, Aaron J. 
APPLICANT: Pillutla, Renuka 
APPLICANT: Reinberg, Danny 
APPLICANT: Yu, Zheng 
APPLICANT: Moldanado, Edio 

TITLE OF INVENTION: mRNA CAPPING ENZYMES AND USES THEREOF 
FILE REFERENCE: 601-1-079 ss 
CURRENT APPLICATION NUMBER: US/09/134, 218A 



CURRENT FILING DATE: 1998-08-14 
NUMBER OF SEQ ID NOS : 19 
SOFTWARE: Patentin Ver. 2.0 
SEQ ID NO 1 
LENGTH: 4160 
TYPE : DNA 

ORGANISM: Mus mus cuius 
US-09-134-218-1 

Query Match 2.1%; Score 36.6; DB 4; Length 4160; 

Best Local Similarity 46.6%; Fred. No. 1.1; ^ ^ n 

Matches 117; Conservative 0; Mismatches 134; Indels 0; Gaps U; 

TATAATCAGAAATTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAACCAACATT 1476 

II I I i II MM III II III Ml Mill I I II I 

TAAACTTAAAATCCAACATTTAAAAAACTCAATATGCTTACAGCTTCAGATTGCTAGTTA 2264 



Qy 


1417 


Db 


2323 


Qy 


1477 


Db 


2263 


Qy 


1537 


Db 


2203 


Qy 


1597 


Db 


2143 


Qy 


1657 


Db 


2083 



TGCATCTCCTATCTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAAATTAGAT 

MM! II M I Ml I I M I II III II II 

TGAATCAAATGTAAAGGTATCTATTCACATACAAACAGGCTCTATTTCATTAACTTCAAT 

GTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGATAAGACAATTCTTGTC 

I M I II I I I I I II I Ml M I I I I I II 

CTGATTTAACCTTTGGGTATTTCAATCTGTAGACTCCACAGGGTAAGGCTGAATTTATTC 

AAAAAT GAAAATATTAAATT AGAT GAACTT GCACTT GT GAAGCCACGACAGAGCATGACC 

I II 11 I I M I I I II Ml I I I II I Mil 
AGGTATAAATAAAATATTTAGGTCCATGATGTACTGTAGTTCCAAGGAAACCAAATGTAC 



I I 



1536 
2204 
1596 
2144 
1656 
2084 



RESULT 15 

US-09-801-876B-3 

; Sequence 3, Application US/09801876B 

; Patent No. 6492155 

; GENERAL INFORMATION: 

; APPLICANT: YE, Jane et al 

• TITLE OF INVENTION: ISOLATED HUMAN KINASE PROTEINS, NUCLEIC 

TITLE OF INVENTION: ACID MOLECULES ENCODING HUMAN KINASE PROTEINS, AND USES 
; TITLE OF INVENTION: THEREOF 
; FILE REFERENCE: CL001160 

; CURRENT APPLICATION NUMBER: US/09/801, 876B 
; CURRENT FILING DATE: 2001-03-09 
; NUMBER OF SEQ ID NOS: 8 

; SOFTWARE: FastSEQ for Windows Version 4.0 

; SEQ ID NO 3 

; LENGTH: 148567 

TYPE: DNA 

ORGANISM: Human 

FEATURE : 
; NAME/KEY: misc_feature 

LOCATION: ( 1 ) . . . (14 8567 ) 
; OTHER INFORMATION: n = A,T,C or G 
US-09-801-876B-3 



Query Match 2.1%; Score 36.6; DB4; Length 148567; 

Best Local Similarity 46.1%; Pred. No. 12; 

Matches 123; Conservative 0; Mismatches 144; Indels 0; Gaps U; 
Ov 1416 ATATAATCAGAAATTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAACCAACAT 1475 

I I I I I I M I I I I I I I M I I I I I I I I M MM 

Db 31673 AGATAATCAGTTGTTTTAACTTTTAATTTAAGCAGTAGCAGAATGACTTTTTGGGAACTT 31732 

Ov 1476 TTGCATCTCCTATCTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAAATTAGA 1535 

I I I I II I I I I I I I I I II I I I I I II 

Db 31733 AGGAATTTGGAAACCTTTTTATTCTATGTATTGAATATCAACTATGTAATTTAGTCTAAG 31792 

Ov 1536 TGTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGATAAGACAATTCTTGT 1595 

I I I I I I I I I I I I I I M I 1 II I I I I I I I 1 

Db 31793 GTTATATGCTAGAAACATTTCAAAAACGAAAGCAGCAGCAATGACATCAAAAATGCATGT 31852 



1596 CAAAAATGAAAATATTAAATTAGATGAACTTGCACTTGTGAAGCCACGACAGAGCATGAC 1655 

MM II II II nil III iL_JLL__-! 

Db 31853 



CAAAAGCAAATGGTTTTAAATAGAAATACATCATTTTAACAATCTTGAAGTTTAAAAGAT 31912 



Qy 1656 CCTCAGCTCAACTTTCACCAATAAAGA 1682 

II I II I I I I I I I I 

Db 31913 CCTATAAAAATCACAAACCCAGAAGGA 31939 



Search completed: September 29, 2004, 02:05:54 
Job time : 159 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen 



OM nucleic - nucleic search, using sw model 

Run on- September 29, 2004, 00:10:08 ; Search time 1188 Seconds 

^ (without alignments) 

7431-150 Million cell updates/sec 



Title: 
Perf ec 
Sequence: 

Scoring table 



US-10-069-541-5 



Perfect score: H^^^^^^^^^^^^^^,,^^ ctgaagataatttacagtga 1743 

IDENTITY_NUC 
Gapop 10.0 , Gapext 1.0 



6674772 



Searched: 3337386 seqs, 2532474682 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 
Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Published Applications_NA: * 

Database ^ 6/ptodata/l/pubpna/US07_PUBCOMB. seq: * 

2 • /cgn2~6/ptodata/l/pubpna/PCT_NEW_PUB .seq: * 
3 • / cgn2"6/ptodata/ I/pubpna/US06_NEW_PUB . seq : * 
4- /cgn2~6/ptodata/l/pubpna/US06_PUBCOMB.seq:* 
5^ /cgn2~6/ptodata/l/pubpna/US07_NEW_PUB.seq:* 
6- /cgn2~6/ptodata/l/pubpna/PCTUS_PUBCOMB.seq:* 
7 I cgn2~6/ptodata/ l/pubpna/US08_NEW_PUB . seq : * 
8- /cgn2"6/ptodata/l/pubpna/US08_PUBCOMB.seq:* 
9 • / cgn2~6/ptodata/ 1/pubpna/US 0 9A_PUBC0MB .seq:* 

10- /cgn2 6/ptodata/l/pubpna/US09B_PUBCOMB. seq: 

11- /cgn2~6/ptodata/l/pubpna/US09C_PUBCOMB.seq: 

12- /cgn2~6/ptodata/l/pubpna/US09_NEW_PUB.seq:* 

13- /cgn2"6/ptodata/l/pubpna/US09_NEW_PUB.seq2: 

14- / cgn2~6 / ptodata/ 1 /pubpna /US 1 0A_PUBCOMB .seq: 

15- /cgn2''6/ptodata/l/pubpna/USlOB_PUBCOMB.seq: 

16- /cgn2"6/ptodata/l/pubpna/US10C_PUBCOMB.seq: 

17- /cgn2~6/ptodata/l/pubpna/US10_NEW_PUB.seq:* 
18 • /cgn2~6/ptodata/l/pubpna/US60_NEW_PUB. seq: * 
19 : /cgn2~6/ptodata/l/pubpna/US60_PUBCOMB. seq: * 

Pred. NO. is the number of results predicted by chance to have a 
!core greater than or equal to the score of the result berng printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 



Query 



No. Score Match Length DB ID 



Description 



c 
c 



c 
c 



c 
c 



c 
c 



1 

2 
3 
4 
5 
6 
7 
8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 

27 

28 

29 

30 

31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 
45 



1743 
1743 
1394.2 
1375 
1375 
630.8 
630.8 
376.6 
363.8 
242.6 
180.8 
180.8 
155 
118.6 
118.6 
72 
60 
53.8 
41.4 
41 
41 
39.8 
39.8 
39.6 
39.6 
39 
39 
39 
39 
39.2 
38.2 
38.2 
38 
38 
38 
38 
38 
38 



.6 
.4 
. 4 
.2 



100.0 
100.0 
80.0 
78.9 
78.9 



.2 
.2 
.2 
.2 
.2 
.2 
38 
38 
38 
38 
37.8 
37.6 
37.6 



1743 
1813 
4904 

1743 
1743 



36.2 119040 
36.2 142299 
1833 
1985 
1461 
119040 
142299 
455 



21.6 
20-9 
13.9 

10 
10 

8 

6 

6 

4 

3 

3 

2 

2 



2.4 
2.3 



2. 

2. 
2, 

2. 
2 , 
2, 



2.2 
2.2 
2.2 
2.2 
2.2 
2.2 
2.2 
2.2 



943 
943 
96 
60 
65 
61396 
1857 
2839 
666 
666 
2028 
2456 
2475 
578 
578 
1173 
1254 
2028 
2028 
2028 
2326 
2326 
2326 



2.2 1830121 
2.2 1830121 
650 



2 
2 
2 

,2 
.2 
,2 
.2 



650 
1932 
4936 
6306 
867 
867 



10 US-09-911-077A-1 
10 US-09-911-077A-9 
10 US-09-911-077A-5 
10 US-09-9ll'077A-3 
10 US-09-911-077A-23 
10 US-09-911-077A-19 
10 US-09-911-077A-14 
13 US-10-241-784-1 
10 US-09-911-077A-7 

9 US-09-974-300-501 

10 US-09-911-077A-19 
10 US-09-911-077A-14 
9 US-09-864-761-1838 

13 US-10-027-632-120553 

16 US-10-027-632-120553 

9 US-09-864-7 61-18589 

10 US-09-908-975-10249 
10 US-09-908-975-26842 

17 US-10-322-28 1-483 
16 US-10-428-868-3 

9 US-09-995-007-1 

13 US-10-027-632-137101 

16 US-10-027-632-137101 

9 US-09-733-630-1 

9 US-09-733-630-3 

13 US-10-302-172-932 

13 US-10-027-632-192644 

16 US-10-027-632-192644 

17 US-10-437-963-19138 
17 US-'10-437-963-19139 
9 US-09-928-530-3 

15 US-10-162-012-28 

16 US-10-162-102-28 
9 US-09-928-530-1 

15 US-'10-162-012-26 

16 US-10-162-102-26 

15 US-10-329-960-1 

16 US-10-329-670-1 

13 US-10-027-632-190544 
16 US-10-027-632-190544 
16 US-10-428-868-1 

12 US-09-809-665A-29 
15 US-10-239-676-129 

13 US-10-142-426-20 
15 US-10-123-155-20 
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Sequence 19, Appl 
Sequence 14, Appl 
Sequence 1, Appli 
Sequence 1, Appli 
Sequence 501, App 
Sequence 19, Appl 
Sequence 14, Appl 
Sequence 1838, Ap 
Sequence 120553, 
Sequence 120553, 
Sequence 18589, A 
Sequence 10249, A 
Sequence 26842, A 
Sequence 4 83, App 
Sequence 3, Appli 
Sequence 1, Appli 
Sequence 137101, 
Sequence 137101, 
Sequence 1, Appli 
Sequence 3, Appli 
Sequence 932, App 
Sequence 192644, 
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ALIGNMENTS 



RESULT 1 

US-09-911-077A-1 

; Sequence 1, Application US/09911077A 
; Publication No. US20030114399A1 
; GENERAL INFORMATION: 



APPLICANT: BLAKELY, RANDY D. 
APPLICANT: APPARSUNDARAM, SUBRAMANIAM 
APPLICANT: FERGUSON, SHAWN 

TITLE OF INVENTION: HUMAN AND MOUSE CHOLINE TRANSPORTER cDNA 
FILE REFERENCE: VBLT:008US 

CURRENT APPLICATION NUMBER: US/09/911, 077A 
CURRENT FILING DATE: 2001-07-23 
NUMBER OF SEQ ID NOS : 27 
SOFTWARE: Patentin Ver- 2.1 
SEQ ID NO 1 
LENGTH: 1743 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (1) . . (1743) 
US-09-911-077A-1 

Query Match 100.0%; Score 1743; DB 10; Length 1743; 

Best Local Similarity 100.0%; Pred. No. 0; ^ ^ n 

Matches 1743; Conservative 0; Mismatches 0; Indels 0; Gaps U; 

1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

HUM MMMIIIIIMIIIMIIMIIIIIIIIM IIIMIIII 

1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 



Qy 1 

Db 1 
Qy 61 



GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 

1 I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I 1 I I I M I I I I I 
61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 



Qy 121 



GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 

IIIMIIIM I I I M I I I I I I I I M I I I I I I I I M I I I I I I 

12 1 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 



Qy 181 
Db 181 



ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 

IIIIMIIIMMIIMIMIIIIIIIIIIIIIIMIMIIIIMIIIIIIIIMIMII 

ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 



Ov 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 

^ I M I I I I I I I I I I I I I M I I I I M I I I I I I M I I I I I I I I I I I I I M I I I Ill 

Db 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 

O 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 

^ I 1 I I I I M I I I I I M I I I I M I I I I I M 1 I M I I t M I I I I I I I I I I t I I I I I I 1 M I I I 

Db .301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 

361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

Mlllll MIIIIM IIIMIlllllMIIIIIIMIIIIIIIMi 

Db 361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 



120 
120 
180 
180 
240 
240 
300 
300 
360 
360 



Qy 



ov 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 

^ I I I I I I I I I I I I I M I I I I I I I M M I I I I I I I I I I I I I I M I I I I I I M I I I I I I I I I I 

Db 421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 

Ov 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 

^ I I M I I I M I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I M I 

481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 



420 
480 
480 

540 
540 



541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 



I I I I I I I I I I I I M I I I I I I I I I I I M I I I I M 1 .1 I I I I I M I M I I I I I I I I I I I I I I I 

541 CTCTA': 



.TTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 



ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I I I I I 1 I I I I I I I I I I I I I 1 I 1 I I I I 1 I I I I I M M I M I I I I 1 I M I I I I I I I I 

ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 



661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I I I I I I I I I I I I I M I M I I I I I I I I I M I I I I I I M I I I I I I I I I I I 

661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 72 0 
721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

II iiiiiiilllllllllllllllMlMIMIIIMllMIMIIIIIIIIIII 

CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 
I I I 1 1 M I M I I I I I M I I I i I I I I I I M I N M 1 I MN I iliiiliiiliilliiii' 



721 
781 
781 
841 



840 



GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 

TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

IIMlllllllllllllllMMIIIIIIIMIIIIIIIIIMIllllMlllliMlll 

841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 



901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 

I I I I I I I I I I I I I I I M I I I I I I I I I I I I I M I I N I I I I I I N M I I I I I I I I 

901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 

961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 

I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I M I I I 1 I I I I I M I I M I I M I I I I 

961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 
1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 

I I I I I I I I I I I I I I I M M I I I I I I I I I I I I I I N I I I M I M I I I I I I I M 

1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 
1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 

I I I I I 1 I I I I I I I I N I I I M I I 1 I I I I M I I I I Ml I II I I Ml Ml l_M_l_l 1^1,1,1,1,1, 

1081 
1141 
1141 



CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 
ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 

I I I I I I I I I I I I M I I I 11 I I M M M M I I I I I I II I I M M M I II M II I M 

ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 



960 

960 
1020 
1020 
1080 
1080 
1140 
1140 
1200 
1200 
1260 



1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 

IIIMIlllllMIIMMMMMMIMMIMIIIIIMMIMMMMMMMI 
1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 



1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 

I I I I I I I I I I I I I I I I I I II M M M M II I M I II I I M I I II M I I II I I I I I 

1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 
1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 

IMIMMIMMMMMIMMM M I I 1 II I II I II II M II II M I I Ml 

1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTT6ATC 



1320 
1320 
1380 
1380 



Ov 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

Qy uMlllllllMlllMlllNlllllllllim 1 Ill ^^^^ 

Db 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

nv 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

I I III 11 IMIIIIIIMIMIIMIIIIIII Illllllllliilil 

Db 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 

nv 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 

Qy nMlllilMIIUIIilillMIIMMIIIIIIIIIIIIIIIliiliilllMIMI 
Db 1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 

ov 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

II I I I I I I II I II I M I I I M I I 1 I I I I I 11 I 1 1 II M II 11 II M II I 11 I 

Db 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 



1500 
1560 
1560 



1620 
1680 



nv 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 

IIIIMIIIIIIMIMIIIMIM Mlllllllllllllllllllllllllll 

GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 



Db 1621 

nv 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

^ IIIMMMIIMIIMMIMIIIMIIMIIIMMIIIIIIIIIIMIIIMIIIII 

Db 1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

Qy 1741 TGA 1743 

I I I 

Db 1741 TGA 1743 

RESULT 2 

US-09-911-077A-9 

Sequence 9, Application US/09911077A 
Publication No. US20030114399A1 
GENERAL INFORMATION: 
APPLICANT: BLAKELY, RANDY D. 
APPLICANT: APPARSUNDARAM, SUBRAMANIAM 
APPLICANT: FERGUSON, SHAWN 

TITLE OF INVENTION: HUMAN AND MOUSE CHOLINE TRANSPORTER cDNA 
FILE REFERENCE: VBLT:008US 

CURRENT APPLICATION NUMBER: US/09/911, 077A 
CURRENT FILING DATE: 2001-07-23 
NUMBER OF SEQ ID NOS : 27 
SOFTWARE: PatentIn Ver. 2.1 
SEQ ID NO 9 
LENGTH: 1813 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (19).. (1761) 
US-09-911-077A-9 

Query Match 100.0%; Score 1743; DB 10; Length 1813; 

Best Local Similarity 100.0%; Pred. No.O; 

Matches 1743; Conservative 0; Mismatches 0; Indels 0, Gaps 0, 



Qy 



ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 



19 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 78 

61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 12 0 

M I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I M I I M I I I I I 

79 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGTiAGAGCGCAGCGAA 138 

121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 180 

I I M I I I I I I I I M I I I I I I I I I I I I M I I I I I I I M I M I M I I I I I I I I I I I I I I I I I 

139 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 198 

181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 24 0 

I I I I M I I I I I M I I I I I I I I I I I I M IIIIIM I I I I I I I I I I I I I I I I 

199 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 258 

241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

I I 1 I I I M I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I M I I I M I I I I M I I I I 

259 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 318 
301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I M I I M I M I 

319 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 378 
361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

I M I I I I I I I I I M I I I M I I I I I I I M I I I I M I I I M I I I I I I I I I I I I I I M I M I I 

37 9 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 438 
421 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I M M I M M I I I I I I I I I I I I I I I i M M I I I M I I 

439 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 498 

481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

I M I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
499 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 558 

541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I I I I I M I I I I I I I I I I MMMIMIIIIIIIIIIIIIIIMIIIIIIIMMI 

559 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 618 
601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I M M I M I I I I I I I I I I < I I I I I I I 

619 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 678 
661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

I I I I I I I I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I M I I I I I M I I I 

679 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 738 
721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 780 

I M I I I I I I I I I M I M I I M I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I 

739 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 798 

781 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I I I I I I I I I I I I M I I I I I M I I I I 
799 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 858 

841 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I M I I I I I M M I I I I I I I I I M I I 



859 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 918 
901 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 960 

I I I I I I I I I I I I I I M I I M I I I I M I M I I I I I I M I I I I I I I I I I I M I I M I I I I I I 

919 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 978 
961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

IIIMMIIIMIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIINIIIIMIMMI 

979 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1038 
1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

I I I I I I I M I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I 

1039 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1098 
1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 114 0 

MIIIIIIIIIIMIIMMMI IIIIIIIIIIIIIIIIIIIIIIMIIIIIII 

1099 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1158 

1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1200 

I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I 
1159 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 1218 

1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 12 60 

I I I I M I I I I I I I M M I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I M I I I I I 

1219 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1278 
1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

I I I I I I I I I M M I I I I I I I I I I I I I I I I I I M I I M I I I I I I M I I I I I I I I I I I I I I I 

1279 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1338 

1321 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 13 8 0 

M M i I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I M Illlllllllll 

1339 TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1398 

1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

I I I I M I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I 

1399 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1458 
1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

I I I I M I I I I I I I I I I I I I I I I I M M I N I I I I I I I I I I I I I I M I I I I I I I I 

1459 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1518 

1501 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I M I I I I I I I I I M I I I I I I I I I I 
1519 CTATTTG/WVGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1578 

1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

IIIMMIIIMIMIIMMIMIIII I I I I I I M I M I I I I I I I I I I I M I I 

1579 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1638 

1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

I I I I I I I I I I I I I I I I I M I I I M M I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 
1639 GAACTTGCACTTGTGT^AGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1698 

1681 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

I I M I I I I I I I I I I I I I M M I I I M I I I I I I I I I I M I I I I I I I I M I I I I I I I I M I I 

1699 GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1758 



Qy 

Db, 



1741 TGA 1743 
I I I 

1759 TGA 1761 



RESULT 3 

US-09-911-077A-5 

Sequence 5, Application US/09911077A 
Publication No. US20030114399A1 
GENERAL INFORMATION: 
APPLICANT: BLAKELY, RANDY 
APPLICANT: APPARSUNDARAM, SUBRAMANIAM 
appT.TCANT: FERGUSON, SHAWN 

?!?leS? invention: HUMAN AND MOUSE CHOLINE TRANSPORTER cDNA 
FILE REFERENCE: VBLT:008US 

CURRENT APPLICATION NUMBER: US/09/911, 077A 
CURRENT FILING DATE: 2001-07-23 
NUMBER OF SEQ ID NOS: 27 
SOFTWARE: PatentIn Ver. 2.1 
SEQ ID NO 5 
LENGTH: 4904 
TYPE : DNA 

ORGANISM: Rattus norvegicus 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (224) . . (1966) 
US-09-911-077A-5 

Query Match 80.0%; Score 1394.2; DBlO; Length 4904; 

Best Local Similarity 87.5%; Pred No 0; . 

Matches 1525; Conservative 0; Mismatches 218, Indels u, p 

ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTT^ 60 

GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGC^ 120 
lUiiiiiiiriii^^^^^ 343 

.ccatca™ 

iiiiiiiiiGUGUGciGi^ciiiGGTTTGTTGGTTGGTGGTTTT^ 
ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTAC^ 

, ,1 I I I I I I I I II I I I I I II M I I M I I I I I I I I I I I MINI' I 

IcciGGiiTGiiUiiiTiUiiiicGGGACAGC^ ^^3 

GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

?? Mmilllllillllllll IIIIIIMIIIMI IIIIMIIIIIIIMMIMI 

GGTciiiciiGiiciiJvGGciciciTUGATATTCTCTGAGTCTGATTTTAGG^ 523 
TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACC^ 
iiTiiiGiJJJJvciiiiGciiicciiGGiAiiiiTG 



Qy 


1 . 


Db 


224 


Qy 


61 


Db 


284 


Qy 


121 


Db 


344 


Qy 


181 


Db 


404 


Qy 


241 


Db 


464 


Qy 


301 


Db 


524 



180 
403 



361 ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 

II MM 111 M IIMMM I M Illlllll Ml III 

584 ATCTATGGAAAGCGCATGGGTGGGCTGCTGTTCATCCCTGCACTGATGGGAGAGATGTTC 
4 2 1 TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGGGTGATCATCGATGTGGAT 

IIIIIIIIMIMIIIMMIM M M 11 Ml II 111 111 Mill II Ml I 111 

644 TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCTACCATCAGCGTAATCATTGATGTGGAT 

4 8 1 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 

I, nil II 111 II MM IMIMMIM II M II I MM III I 

704 GTGAACATATCGGTCATTGTCTCCGCACTCATTGCCATTCTTTATACCCTCGTGGGAGGG 



541 
764 
601 
824 
661 



CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 

I II II I II 11 II II I I M 11 II 1 11 M M II 11 I MM Mill 

CTCTACTCTGTGGCATATACTGATGTTGTACAGCTATTCTGCATTTTTATAGGATTGTGG 
ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 

lllli ill M MMMMMIMMll 1 Mill M IMMMIMII 

ATCAGTGTCCCATTTGCCCTGTCACATCCTGCAGTCACCGACATTGGATTCACTGCTGTG 



CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 

11111111111111 11 II 11 I I 1 I I II 1 II 1 I M 11 M I 11 I 

884 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 



721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 

lllllii IIIIIMMMMMMMMMMM IIIIMIIlll Mill MMM 
944 CTTGATAATTTTCTGTTGTTGATGCTGGGTGGAATACCATGGCAAGCCTACTTCCAGAGG 

7 8 1 GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 

I, IIMMM 11 Mill IIIIMIMII MIIMlllllllllllMIIII Ml 
1004 GTCCTCTCTTCATCGTCAGCGACCTATGCTCAGGTGCTGTCCTTCCTGGCAGCTTTTGGG 

8 4 1 TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 

I I nil III Ml II I IIMMM 111 Illlllll IMMII II I II I MM I 

1064 TGCCTGGTGATGGCTCTACCAGCCATTTGCATTGGGGCCATTGGAGCCTCCACAGACTGG 

9 0 1 AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 

II n I MM 111 MM Ml Ml MM 11 I M II I mil MM 11 I 

1124 AACCAAACTGCATATGGGTTTCCAGATCCCAAGACCAAGGAGGAAGCAGACATGATTCTC 
961 CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 

II IIMMM Mill IIIIMIIMIMI IMM IIMMM I 1 Ml 

1184 CCGATTGTTCTACAGTACCTCTGCCCTGTGTACATTTCCTTCTTTGGGCTTGGTGCTGTT 



420 
643 
480 
703 
540 
763 
600 
823 
660 
883 
720 
943 
780 
1003 
840 
1063 
900 
1123 
960 
1183 
1020 
1243 
1080 



1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 

nillllllll Mill I M Ml 111 1 MMMllllllllMllll 

1244 TCTGCTGCTGTCATGTCCTCGGCTGACTCATCCATCCTATCAGCAAGTTCCATGTTTGCT 1303 



1081 CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 

mil MMMMMMMMMMIMMIMI II Mill Mill M Mill 

1304 CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 

1141 ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 

in 1 Mill llllMMl llllllllllMllllllMlllllll Mill 

1364 iTGAGGATCACTGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTCACGAAG 



1201 



ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 



1140 
1363 
1200 
1423 
1260 



IMIIIIIIIIIIIMIIIIIII II MMIIIMII IIMII I llllllll III 

Db 1424 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 

Qy 1261 



CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 

I I I I I I I I I I I II I M I M I II II I I II M I I I I M M II II I I II I I II 
Db 14 84 CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 



Qy 1321 
Db 1544 



1483 
1320 
1543 
1380 



TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 

I III II Mlllllllll II INN IIIIIIIIMI 11 I III Mill III 
TTTGGACTTTTCCTGAGAATTACCGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1603 

Ov 1381 TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

^ IMIIIMMI Mlllllllll I IIIMIIMM IIIIMI II III Ml II I 

Db 1604 TTCTACCCTGGTTATTACCCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 



Ov 1441 ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 

II II MIIMIIM MIIIIM II M I IIMMMIIIIMIIII 

Db 1664 ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCCTATCTAGCCAAGTAT 



Qy 1501 
Db 1724 



CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 

IIMMMMMIMMMMII M IIIIMIM MIIIIIMIMMM MM 

CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATATATTTGATGCTGTTGTCTCAAGG 



Ov 1561 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 

^ Mill IIMII MIIIIM Mill IM II MM lllllllll II IIMII II 

Db 1784 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAACATCAAATTAAAT 



Qy 1621 

Db 1844 

Qy 1681 

Db 1904 



GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 

IIIIIIMII Ml Mill lllllllll I MIMIII IIMIMIIIIIIMIII 

GAACTTGCACCTGTAAAGCCTCGACAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAA 
GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 

I I I II I II I M M I M M M I I M M II I I M II M I II II II II II II Mill 

GAGGCTCTCCTTGATGTTGATTCCAGTCCAGAGGGATCTGGGACTGAAGATAACTTACAA 



1663 
1500 
1723 
1560 
1783 
1620 
1843 
1680 
1903 
1740 
1963 



Qy 1741 TGA 1743 

I I I 

Db 1964 TGA 1966 



RESULT 4 

US-09-911-077A-3 

; Sequence 3, Application US/09911077A 

; Publication No. US20030114399A1 

; GENERAL INFORMATION: 

; APPLICANT: BLAKELY, RANDY D. 

; APPLICANT: APPARSUNDARAM, SUBRAMANIAM 

; APPLICANT: FERGUSON, SHAWN 

; TITLE OF INVENTION: HUMAN AND MOUSE CHOLINE TRANSPORTER cDNA 
; FILE REFERENCE: VBLT:008US 

; CURRENT APPLICATION NUMBER: US/ 09/ 911 , 077A 
; CURRENT FILING DATE: 2001-07-23 
; NUMBER OF SEQ ID NOS : 27 
; SOFTWARE: Patentin Ver. 2.1 
; SEQ ID NO 3 
; LENGTH: 1743 
TYPE : DNA 



ORGANISM: Mus musculus 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (1) . . (1743) 
US-09-911-077A-3 

Query Match 78.9%; Score 1375; DB 10; Length 1743; 

Best Local Similarity 86.8%; Pred. No. 0; 

Matches 1513; Conservative 0; Mismatches 230; Indels 0; Gaps 0; 



O 1 ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 

III 11 II I I II II I I I I I I I I I I I I 1 I I I I III I I I M I I I I II 11 II ill 
1 ATGCCTTTCCATGTGGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 

61 GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 

^ I II I I I II I t II I I I II INI I II II I I I II I I I I I I I II Ill III 

61 GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGCAACCCAGAAGAGCGCAGTGAA 

Ov 121 GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 

^ II II I II I I I I M I I I I I II I I I I I I I I I I I I I I II I M I I I I I I I I I I I I I I 

Db 121 GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 

Ov 181 ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 

^ Ill II MM MM Mill 111 II II I I 

Db 181 ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGAAGCAGTGTATGGGCCAGGTTGT 

Ov 241 GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 

^ II M M I I II II M II Mill I M I M II MM! 11 II II 11 I M 111 

Db 241 GGTCTAGCTTGGGCTCATGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 

Ov 301 TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 

^ II Mill Mill Ml III Mill IIMIIMIM M I II 

Db 301 TTTTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTCAAACAG 

Qy 361 



Db 361 
Qy 421 
Db 421 



ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 

IIIIIIIIMI IIIIMII II 11 II M M IMIIIIilMMIIM MUM 

ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGAGAGATGTTC 



TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 

IIIIIMII IIIIIMII II II llllillllll MIIM 

TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCCACCATCAGCGTGATCATTGATGTGGAT 



OV 481 ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 

II M II II M I II II I I II II M I M I II I I II I I II I 11 1 II 

Db 4 81 GTGAACATATCGGTCATTGTCTCTGCACTCATTGCCATTCTTTATACCCTAGTGGGTGGG 

Ov 541 CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 

^ Mill MM MMIIM 11 Mill Mill MM MIIM 

Db 541 CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTTTTATAGGACTGTGG 

Ov 601 ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 

II I II I I I I I II II I II I I II II II I I II I II I MMIIM II II I MIIM 

Db 601 ATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAGCTGTG 

Ov 661 CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 

I I I I I I I I I I I I I I M I I I M II I II I II M II 1 II II I li M 1 I 1 II 
Db 661 CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGAAGTCTACACCTGG 



60 
60 
120 
120 
180 
180 
240 
240 
300 
300 
360 
360 
420 
420 
480 
480 
540 
540 
600 
600 
660 
660 
720 
720 



721 CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 

I I I I I I I I I M I I I I I M I I I I M I I I I I I I M I I I I I I I I N I I I M M I I I M I 

721 CTTGATAATTTTCTGTTATTGATGCTGGGTGGAATCCCATGGCAAGCCTACTTCCAGAGG 



781 
781 
841 
841 
901 
901 
961 
961 
1021 
1021 
1081 
1081 
1141 



GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 

II I II M I M I 1 I 1 I 1 I I I I M I I I I I I I I II 11 I I I M II II I M I I II M III 

GTCCTCTCTTCATCCTCAGCCACCTATGCTCAGGTACTGTCCTTCCTGGCAGCTTTTGGG 
TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 

I I I M I M I M II II I I I II I I I I I I II M I I I I I I M I I I II 

TGCCTGGTGATGGCTCTACCCGCCATATGCATAGGAGCTATTGGAGCTTCCACAGACTGG 
AACCAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTA 

I II I I II I I I I I I M I I I I I M I I I I II I I I I I II II i I I I I I M I I I I I 

AACCAGACTGCCTACGGGTATCCAGATCCCAAGACTAAGGAGGAAGCAGACATGATTCTC 
CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 

II II IIIIIIIIMI IIMIIIMIMII II I nil IIIIMM Ml 

CCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGGCTTGGTGCTGTT 
TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 

II MIIIIM Mill Mill M II null MM II Mill MIIMM 

TCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGTTCTATGTTTGCT 
CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 

I I I M M M M II M I M II II II I I M II Mill Mill II II II I 

CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 



780 
780 
840 
840 
900 
900 
960 
960 
1020 
1020 
1080 
1080 
1140 
1140 
1200 



ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGA7\A 

III I IIMI III MUM I IMIMMMMMMMIMM II MM MM I 

1141 ATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGGCATGGCTTTGCTGACGAAG 1200 
1201 ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

1 I II II M M M II I I M I M I I M I I M M II II I MUM I MIIMM III 

1201 ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1260 



1261 CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 

I I I I I I I II II II II I I M II II II M II I MIIMM II II II II II M 

CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 



1261 
1321 
1321 
1381 
1381 
1441 
1441 



TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 

I III M M II II II I II MIIMM II II I M I II I M I II M II I I Ml 

TTTGGACTATTCCTGAGAATTACTGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 
TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 

I II I I I M M 1 II II M M II I II I II II I I M II M II I II M II II M I 

TTCTACCCTGGTTATTACTCTGACAAG7UVTGGTATATACAATCAGAGGTTCCCATTTAAA 
ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 

II II -IMIMMIi IMIMII IIMMMM Illllllllllll 

ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCTTATCTAGCCAAGTAT 



1601 CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 

IIMMMMMMMMMMI M II II M I II I II II M II II M II II Mill 
1501 CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATGTATTTGATGCTGTTGTCGCAAGG 



1320 
1320 
1380 
1380 
1440 
1440 
1500 
1500 
1560 
1560 



Ov 1661 CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

^ II MM Mill III MM I Mill Mill MM I II MM I MM MUM It 

Db 1561 CACAGTGAAGAGAACATGGACAAGACCATTCTAGTCAGAAATGAAAATATCAAATTAAAT 1620 

OV 1621 GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 1680 

II Mill! II MMII II M I Mill I II MM MM Ml III MM 

GAACTTGCACCTGTGAAACCTCGGCAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAG 1680 

GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

MMII MMMMMMMMMMM II IMIIItl MMII 1 

GAGGCCCTCCTTGATGTTGATTCCAGTCCGGAGGGGTCTGGGACTGAAGATAATTTACAA 1740 



Db 1621 
Qy 1681 
Db 1681 



Qy 1741 TGA 1743 

I I I 

Db 1741 TGA 1743 



RESULT 5 

US-09-911-077A-23 

Sequence 23, Application US/09911077A 
Publication No. US20030114399A1 
GENERAL INFORMATION: 
APPLICANT: BLAKELY, RANDY D. 
APPLICANT: APPARSUNDARAM, SUBRAMANIAM 
APPLICANT: FERGUSON, SHAWN 

TITLE OF INVENTION: HUMAN AND MOUSE CHOLINE TRANSPORTER cDNA 
FILE REFERENCE: VBLT:008US 

CURRENT APPLICATION NUMBER: US/09/911, 077A 
CURRENT FILING DATE: 2001-07-23 
NUMBER OF SEQ ID NOS : 27 
SOFTWARE: Patentin Ver. 2.1 
SEQ ID NO 23 
LENGTH: 1743 
TYPE: DNA 

ORGANISM: Mus mus cuius 
FEATURE : 
NAME/KEY: CDS 
LOCATION: (1) . . (1743) 
US-09-911-077A-23 

Query Match 78.9%; Score 1375; DB 10; Length 1743; 

Best Local Similarity 86.8%; Pred. No. 0; 

Matches 1513; Conservative 0; Mismatches 230; Indels 0; Gaps 0, 



Qy 


1 


ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 

Ml III II II 1 1 II 11 1 II M M 1 III 1 11 II II M M II M III 

ATGCCTTTCCATGTGGAAGGACTGGTAGCTATTATCCTCTTCTACCTCCTTATATTTCTG 


60 


Db 


1 


60 


Qy 


61 


GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 

MM MM MMII Ml MM MM Mill Ml MM 1 MM MM Mill Ml 

GTTGGAATATGGGCTGCATGGAAAACCAAAAACAGCGGCAACCCAGAAGAGCGCAGTGAA 


120 


Db 


61 


120 


Qy 


121 


GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 

MM MMII 1 M Mill M II II MM Mill 11 MM N II II II M MM 

GCCATCATAGTCGGGGGCCGTGACATTGGTTTGTTGGTTGGTGGTTTTACCATGACAGCC 


180 


Db 


121 


180 


Qy 


181 


ACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTAT 


240 



Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 


Qy 


541 


Db 


541 


Qy 


601 


Db 


601 


Qy 


661 


Db 


661 


Qy 


721 


Db 


721 


Qy 


781 


Db 


781 


Qy 


841 


Db 


841 


Qy 


901 


Db 


901 


Qy 


961 


Db 


961 



I I I I I I I I I I I I I I I I II I I I I I I I I Mill I I I I I I M I I I I I I I I I I I I 

ACCTGGGTTGGAGGAGGCTACATCAATGGGACAGCAGAAGCAGTGTATGGGCCAGGTTGT 240 

GGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTG 300 

II II I I M I II I I I I I I I I I I I I I I I I II M I I M I I I M I I I I I I I M I I III 
GGTCTAGCTTGGGCTCATGCACCCATTGGATATTCTCTGAGTCTAATTTTAGGTGGTCTG 300 

TTCTTTGCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAA 360 

II Mill M I II I II I II M I Mill II M II II II I II I II I I I II I II 
TTTTTTGCGAAACCTATGCGTTCCAAGGGATATGTGACTATGTTAGACCCATTCAAACAG 360 

ATCTATGGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTC 420 

II I I I I I I I M II I I II I I M II II M II III M I I II I I I I M I M 

ATCTATGGAAAGCGCATGGGTGGGCTGCTCTTCATCCCTGCACTGATGGGAGAGATGTTC 420 
TGGGCTGCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGAT 480 

M II II I M II I II M M M II I M II I I II I I M I I I II I I II I I I II I I I I I I I 

TGGGCTGCAGCAATTTTCTCTGCATTAGGGGCCACCAtCAGCGTGATCATTGATGTGGAT 480 

ATGCACATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGG 540 

II MM II II II I I II I I II I II II I I II I I Ml II II II I I II I III 
GTGAACATATCGGTCATTGTCTCTGCACTCATTGCCATTCTTTATACCCTAGTGGGTGGG 540 

CTCTATTCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGG 600 

I II II M II II II II II II II II II II I I I II II II II II I MM II M II 

CTCTACTCTGTGGCATATACTGATGTTGTCCAGCTATTCTGCATTTTTATAGGACTGTGG 600 

ATCAGCGTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTG 660 

Mill 11 I II II II I I II M II II I M II II I I II M I I II I I I II II I I II 
ATCAGTGTCCCTTTTGCCCTGTCACATCCTGCAGTCACCGACATCGGATTCACAGCTGTG 660 

CATGCCAAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGG 720 

II II I M I M I M I II M I II II M M MM III II I I II I I II I III 

CATGCTAAATACCAGAGTCCCTGGCTGGGAACCATTGAATCAGTTGT^GTCTACACCTGG 720 
CTTGATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGG 78 0 

II II II I I II I I I II I I I II M II II I II I II I I I M II II II M Mill MUM 

CTTGATAATTTTCTGTTATTGATGGTGGGTGGAATCCCATGGCAAGCCTACTTCCAGAGG 780 
GTTCTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGG 840 

M I II M I M II II II II II II I I M I II I M I I II I II M II I I II II II I III 

GTCCTCTCTTCATCCTCAGCCACCTATGCTCAGGTACTGTCCTTCCTGGCAGCTTTTGGG 84 0 

TGCCTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGG 900 

II II II II II II II I II II I I I I II I M I I II II I I I I II I I II I I I I I 
TGCCTGGTGATGGCTCTACCCGCCATATGGATAGGAGCTATTGGAGCTTCCACAGACTGG 900 



M M I II M II II III II M II II II II M M I II II I M II I M II I I I 



CCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTT 1020 

II M M II I II I M I II II II II II I I II M II II II II I I II I I II I I III 
CCGATCGTTCTGCAGTACCTCTGCCCTGTGTACATCTCCTTCTTTGGGCTTGGTGGTGTT 1020 



1021 TCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCA 1080 

II I II M II I I II I I Mill M M MUM I I II II I I I II II M II I I 



Db 


1021 


Qy 


1081 


Db 


1081 


Qy 


1141 


Db 


1141 


Qy 


1201 


Db 


1201 


Qy 


1261 


Db 


1261 


Qy 


1321 


Db 


1321 


Qy 


1381 


Db 


1381 


Qy 


1441 


Db 


1441 


Qy 


1501 


Db 


1501 


Qy 


1561 


Db 


1561 


Qy 


1621 


Db 


1621 


Ov 


1681 


Db 


1681 


Qy 


1741 


Db 


1741 



1021 TCAGCTGCTGTCATGTCCTCAGCTGACTCGTCCATCCTGTCGGCGAGTTCTATGTTTGCT 1080 

CGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTT 1140 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I Mill II I I I I I 
CGGAATATCTACCAGCTTTCCTTCAGACAAAATGCATCAGACAAGGAAATTGTGTGGGTC 1140 

ATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAA 12 00 

III I I I I I I III II I I I M I I I I I I I I M I I I I I I I I I I I II I I I I I I I M I I 
ATGAGGATCACTGTGCTTGTGTTCGGAGCATCTGCAACAGCCATGGCTTTGCTGACGAAG 12 00 

ACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAG 1260 

I I I I I I I I II II I I I I I I I I I I I II I I I I II I I I I I I I I I II I I I I I I I I I III 
ACTGTGTATGGGCTCTGGTACCTGAGCTCTGACCTTGTCTACATCATCATCTTCCCACAG 1260 

CTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTT 1320 

Mill I M II M M II I II I I I I I I II I I I II I II I I I M II II I I II II 
CTGCTCTGTGTACTCTTCATCAAAGGAACCAACACTTATGGGGCAGTTGCTGGTTATATT 1320 

TCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATC 1380 

I II I II M I I I II II II I I I II M I I II I I I I M I I II I I I I I I I I I III 

TTTGGACTATTCCTGAGAATTACTGGAGGAGAGCCATATCTATACTTGCAGCCCTTAATC 1380 

TTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAA 1440 

M I II II M II I M I I I MM I II I II I I II II II I I II I II II I II I I I I 
TTCTACCCTGGTTATTACTCTGACAAGAATGGTATATACAATCAGAGGTTCCCATTTAAA 1440 

ACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTAT 1500 

II M II M II II I I II I II I II I I I I II I I I II I M II I I I I II I I II I M 

ACTCTCTCCATGGTTACCTCATTCTTTACCAACATTTGTGTTTCTTATCTAGCCAAGTAT 1500 

CTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGTTGCAAGA 1560 

II I I II I I I II M I I I M I I II I II I I I II I I II I M I II I II I I II II II Mill 
CTATTTGAAAGTGGAACCTTGCCTCCAAAATTAGATGTATTTGATGCTGTTGTCGCAAGG 1560 

CACAGTGAAGAAAACATGGATAAGACAATTCTTGTCAAAAATGAAAATATTAAATTAGAT 1620 

II II II M I II II M II II I I I II I I I I I I I I I I I I M I II I I I I Mini II 

CACAGTGAAGAG7!JVCATGGACAAGACCATTCTAGTCAGAAATGAAAATATCAAATTAAAT 1620 

GAACTTGCACTTGTGAAGCCACGACAGAGCATGACCCTCAGCTCAACTTTCACCAATAAA 168 0 

II II II II II II II II II II II I I I I I I I II I II I I II I II II I II I I I II I 
GAACTTGCACCTGTGAAACCTCGGCAGAGCCTAACCCTCAGTTCAACTTTCACCAATAAG 1680 

GAGGCCTTCCTTGATGTTGATTCCAGTCCAGAAGGGTCTGGGACTGAAGATAATTTACAG 1740 

II II I I II II I II I M I I I II I M I I II II II II II II I I I II I I II I II I I I I I I 
GAGGCCCTCCTTGATGTTGATTCCAGTCCGGAGGGGTCTGGGACTGAAGATAATTTACAA 1740 



I I 



RESULT 6 

US-09-911-077A-19 

; Sequence 19, Application US/09911077A 

; Publication No. US20030114399A1 

; GENERAL INFORMATION: 

; APPLICANT: BLAKELY, RANDY D. 



APPLICANT: APPARSUNDARAM, SUBRAMANIAM 
APPLICANT: FERGUSON, SHAWN 

TITLE OF INVENTION: HUMAN AND MOUSE CHOLINE TRANSPORTER cDNA 
FILE REFERENCE: VBLT:008US 

CURRENT APPLICATION NUMBER: US/09/911, 077A 
CURRENT FILING DATE: 2001-07-23 
NUMBER OF SEQ ID NOS : 27 
SOFTWARE: PatentIn Ver. 2.1 
SEQ ID NO 19 
LENGTH: 119040 
TYPE : DNA 

ORGANISM: Homo sapiens 
FEATURE: 

NAME/KEY: modif ied_base 
LOCATION: (2347) . . (90873) 
OTHER INFORMATION: N = A, C, G or T/U 
US-09-911-077A-19 

Query Match 36.2%; Score 630.8; DB 10; Length 119040; 

Best Local Similarity 99.7%; Pred No. l-2e-178; 

Matches 632; Conservative 0; Mismatches 2; Indels 0, Gaps 

AAATGCTTCGGACAAAGAAATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGC 1169 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I M I I I I I I N I I I I I I 
UGiciiiiiiiZiiiiicGiiiiiGTTATGCGAATCACAGTGTTTGT 30814 

ATCTGCAACAGCCATGGCCTTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTC 1229 

iicTGCAACAGCCAiGGCCTTGCTGACGA^^ 30874 
TGACCTTGTTTACATCGTTATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAAC 1289 

I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I N I I N II I II I I II 11 I I I I I I I I I 

i-y^icUiiUiiiiciUAiiTTCcicCAGCTGOT 30934 
CAACACCTATGGGGCCGTGGCAGGTTATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGG 1349 

I I 111 I III MM 111 I Ml I MM IIIIMII MM Mill I 111 III I MM I II II I 

iiACACCTATGGGGCCGTGGCA^^^ 30994 

GGAGCCATATCTGTATCTTCAGCCCTTGATCTTCTACCCTGGCTATTACCCTGATGATAA 1409 
I I I I I I I I I I I I I I M II II II II I M II I II I I I II M I I I M II I M II I M I II I I I 
iGiii(!J.iiiiiiTiicTiciGiicTi^ 

TGGTATATATAATCAGAAATTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAAC 1469 

I I II M I II I M I I M I M I II II I I M M 1 I I II I I II It I I M II II I M I I I M I I I 

isCTATATATAATCAGAiJvTTTCCATTTAAW^^ 31114 

1470 CAACATTTGCATCTCCTATCTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAA 1529 

1470 CAACATTTG M I I I I I I I I 11 M I II II M I M I I I I I II 

CAACATTTGCATCTCCTATCTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAA 31174 

ATTAGATGTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGATAAGACAAT 1589 

II 11 I I I I I M II 11 M I II I 11 I M M II M I I I II I I II I I I I I M M 11 II I M I I I 

aUaGATGTATTTGATGCTGTTGTTGCAAGACACAGTGAAG^ 31234 

TCTTGTCAAAAATGAAAATATTAAATTAGATGAACTTGCACTTGTGAAGCCACGACAGAG 1649 
I I I I I I I I I I I II I II 1 I M 1 I II I II I I II M II I I II M I 1 II II I I II I 1 I 1 M I M 
icUiici]JJJ.iiJXiAUJJJ.TT^^^^ 3129. 



Qy 


1110 J 


Db 


30755 J 


Qy 


1170 . 


Db 


o u O X o ^ 


Qy 


1230 


Db 


30875 


Qy 


1290 


Db 


30935 


Qy 


1350 


Db 


30995 


Qy 


1410 


Db 


31055 


Qy 


1470 


Db 


31115 


Qy 


1530 


Db 


31175 


Qy 


1590 


Db 


31235 



Ov 1650 CATGACCCTCAGCTCAACTTTCACCAATAAAGAGGCCTTCCTTGATGTTGATTCCAGTCC 1709 

I I I I I I I 1 I I I M I I I M M I 1 I I I I 1 I M M I I I M lillllilliililllllliiil 

Db 31295 



CATGACCCTCAGCTCAACTTTCACCAATAAAGAGGCCTTCCT^ 31354 



Qv 1710 AGAAGGGTCTGGGACTGAAGATAATTTACAGTGA 1743 

I I I I I I I I I I I I I I I I I M I I M M M I I I M M 
Db 31355 AGAAGGGTCTGGGACTGAAGATAATTTACAGTGA 31388 



RESULT 7 

US-09-911-077A-14 

Sequence 14, Application US/09911077A 
Publication No. US20030114399A1 
GENERAL INFORMATION: 
APPLICANT: BLAKELY, RANDY D. 
APPLICANT: APPARSUNDARAM, SUBRAMANIAM 
APPLICANT: FERGUSON, SHAWN 

TITLE OF INVENTION: HUMAN AND MOUSE CHOLINE TRANSPORTER cDNA 
FILE REFERENCE: VBLT:OOBUS 

CURRENT APPLICATION NUMBER: US/ 09/ 911, 077A 
CURRENT FILING DATE: 2001-07-23 
NUMBER OF SEQ ID NOS : 27 
SOFTWARE : Patentin Ver . 2.1 
SEQ ID NO 14 

LENGTH: 142299 
TYPE: DNA 

ORGANISM: Artificial Sequence 

FEATURE: ^ o 4- • ^ 

OTHER INFORMATION: Description of Artificial Sequence: Syntnetic 

OTHER INFORMATION: Primer 

FEATURE : 

NAME/KEY: modif ied_base 
LOCATION: (1305) . . (127835) 
OTHER INFORMATION: N = A, C, G or T/U 
US-09-911-077A-14 

Query Match 36.2%; Score 630.8; DB 10; Length 142299; 

Best Local Similarity 99.7%; Pred. No. 1.4e-178; , , ^ n. 

Matches 632; Conservative 0; Mismatches 2; Indels 0; Gaps 



Qy 


1110 


Db 


94673 


Qy 


1170 


Db 


94733 


Qy 


1230 


Db 


94793 


Qy 


1290 


Db 


94853 



I I M I I I I I M I I M I I I I M I I I I I I I M I I M I I I I I I I I I I N I I I I I M I I M I 

ACAGGCTTCGGACAAAGAAATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGC 

ATCTGCAACAGCCATGGCCTTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I 1 I I I I I I I M I I I I I 
ATCTGCAACAGCCATGGCCTTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTC 



TGACCTTGTTTACATCGTTATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAAC 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I M I I 
TGACCTTGTTTACATCGTTATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAAC 



94732 
1229 
94792 
1289 



CAACACCTATGGGGCCGTGGCAGGTTATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGG 

I I M I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 1 I 1 I I I I I I I I 
CAACACCTATGGGGCCGTGGCAGGTTATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGG 



1349 
94912 



Qy 1350 



GGAGCCATATCTGTATCTTCAGCCCTTGATCTTCTACCCTGGCTATTACCCTGATGATAA 

I I I I I M I I I I I I I I I I I I I I I I I I I I I I M I I I I I M I I I I I I I I I I I I I I I 

Db 94913 GGAGCCATATCTGTATCTTCAGCCCTTGATCTTCTACCCTGGCTATTACCCTGATGATAA 



1409 
94972 
1469 
95032 
1529 



Ov 1410 TGGTATATATAATCAGAAATTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAAC 

I 1 I I I M M I 1 I I I I I I I M M I I I I I M 1 M I I M I I I I I I I I 1 M I I I I I I I I M I I I 

Db 94973 TGGTATATATAATCAGAAATTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAAC 

Ov 1470 CAACATTTGCATCTCCTATCTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAA 

M I I I I M I I I I I I 1 I I I I I I I M I M M I I N I I M I I I M I I t I I I I I 

Db 95033 CAACATTTGCATCTCCTATCTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAA 95092 

Ov 1530 ATTAGATGTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGATAAGACAAT 1589 

IIIIIIIIIMIMIMIIMIIIIIMIMMMMIMIItlllllllllMMMM 

Db 95093 ATTAGATGTATTTGATGCTGTTGTTGCAAGACACAGTGAAGAAAACATGGATAAGACAAT 95152 

Ov 1590 TCTTGTCAAAAATGAAAATATTA7UVTTAGATGAACTTGCACTTGTGAAGCCACGACAGAG 1649 

lllllltlMllllllllltMltlMMIIIIIIIINMIIIIMIIIMItlMlll 

Db 95153 TCTTGTCAAAAATGAAAATATTAAATTAGATGAACTTGCACTTGTGAAGCCACGACAGAG 95212 

Ov 1650 CATGACCCTCAGCTCAACTTTCACCAATAAAGAGGCCTTCCTTGATGTTGATTCCAGTCC 1709 

1 I M I I I I I M I I i I I M M M I I I I I I I I M I I M M I I I I I M I I I t I I I I I I I M I I 

Db 95213 CATGACCCTCAGCTCAACTTTCACCAATAAAGAGGCCTTCCTTGATGTTGATTCCAGTCC 95272 

Qy 1710 AGAAGGGTCTGGGACTGAAGATAATTTACAGTGA 1743 

II I I I I II M II I I I I I I I I I I M I N I 

Db 95273 AGAAGGGTCTGGGACTGAAGATAATTTACAGTGA 95306 



RESULT 8 
US-10-241-784-1 

Sequence 1, Application US/10241784 
Publication No. US20040048261A1 
GENERAL INFORMATION: 
APPLICANT: Bayer Corporation 

TITLE OF INVENTION: Invertebrate Choline Transporter Nucleic Acid, 
Polypeptides and Uses 

TITLE OF INVENTION: Thereof 
FILE REFERENCE: M07218 

CURRENT APPLICATION NUMBER: US/10/241, 784 
CURRENT FILING DATE: 2002-09-11 
NUMBER OF SEQ ID NOS : 2 
SOFTWARE: Patentin version 3.1 
SEQ ID NO 1 
LENGTH: 1833 
TYPE : DNA 

ORGANISM: Drosophila melanogaster 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (1) . . (1833) 
OTHER INFORMATION: 
US-10-241-784-1 

Query Match 21.6%; Score 376,6; DB13; Length 1833; 

Best Local Similarity 56.0%; Pred. No. 9.9e-103; „ ^ ^« fi. 

Matches 868; Conservative 0; Mismatches 624; Indels 57; Gaps 



8 TCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTGGTTGGAA 67 

II III 1 1 1 I I M 1 1 II ii_[i_iL„li„LiiliLi 

5 TCAATA' 
68 



.TCGCTGGCGTGGTGAGCATCGTGCTCTTCTACCTCCTGATCCTGGTCGTTGGCA 64 



TATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAAGCCATCA 127 

I I I I I I I III 1 II I II I M I III 

65 TTTGGGCCGGTCGCAAGAAGCAGTCCGGCAATGATTCGGAGGAG GAGGTCA 115 

128 TAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCTACCTGGG 187 

. I II II II II II I II II II II iiMMM II mini 

1 1 6 TGCTGGCCGGACGCTCCATCGGCCTCTTCGTGGGCATCTTCACCATGACGGCCACCTGGG 175 
1 8 8 TCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTATGGCCTAG 247 

II II M II 1 1 1 1 1 nil' I ' 'I I i.l!_„!„Liii 

176 



TGGGTGGCGGCTACATCAACGGCACGGCGGAGGCTATATACACATCGGGT CTGG 229 



248 CTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTGTTCTTTG 

111 Mill ill lllllll MM II M I M IMMI I IIMIII 

230 TGTGGTGCCAGGCTCCATTTGGATACGCTCTAAGCTTGGTATTTGGTGGCATCTTCTTTG 
308 CAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAAATCTATG 

I II II I I 11 I II I I I I I I I I M I I I II I 11 I I I I I 

290 CCAATCCCATGCGCAAGCAGGGTTACATCACCATGTTGGATCCGTTGCAGGATTCCTTTG 

368 GAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTCTGGGCTG 

I I I I I I I I 1 11 I I I I I I I I I 1! 1 I I I I 1 1 I 1 I I 1 I I I 
350 GTGAGCGGATGGGAGGATTGCTCTTCCTGCCCGCTCTATGCGGTGAGGTCTTTTGGGCAG 

428 CAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGATATGCACA 

II II I I I I I II I II I I I I 1 I I I I I I I I I IN 

410 CCGGCATCCTGGCTGCACTTGGCGCCACTCTATCGGTGATCATCGACATGGATCACCGCA 

488 TTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGGCTCTATT 

I I I I I I 1 1 I I I lllllll 1 I I I 1 I 1 1 I I I I I I I I I I I I 
4 7 0 CCTCGGTGATCCTGTCCTCCTGCATCGCCATCTTCTACACACTGTTCGGTGGACTGTACT 

548 CTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGGATCAGCG 

I I I I I 1 II 1 1 I I I I I I 1 1 I 1 I 1 1 I I I 1 I I I 1 I I I M I M M 
530 CCGTGGCGTATACGGACGTGATCCAGTTGTTCTGCATCTTCATCGGTCTGTGGATGTGCA 



307 

289 

367 

349 

427 

409 

487 

469 

547 

529 

607 

589 



608 TCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTGCATGCCA 667 



I Mill II I I 1 111 1 I I I I 

590 TTCCCTTCO 



CCTG6AGCAACGAGCACGTGGGCAGCCTGAGTGACCTGGAGGTGGAT 645 



AATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGGCTTGATA 727 



668 .r-Lx^kj-a v'-*^^ " - - — >^^^ ,111 

I I II 11 II I I I L.._i_i! 

646 



TGGATTGGGCACGTGGAGCCTAAAAAGCATTGGCTGTACATAGACT 691 



728 GTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGGGTTCTCT 787 



II II I I I Mill II II I I 1 I LLilill_i!i„ii 
692 ACGGCTTGCTG 



CTCGTCTTTGGTGGCATTCCCTGGCAGGTCTACTTCCAGCGGCAAAAC- 750 



788 CTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGGTGCCTGG 847 



III II III Ml I I II I I lllllll 

751 GGC 



lAGGAAGGGCCCAGCTTCTGCCTATGTTGCAGCCGCCGGATGCATTT 799 



848 TGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGGAACCAGA 



I I I I I I I I I I I I I I I I I I I I I I I I M I INI M I I I I I I I 

Db 800 TGATGGCCATTCCCCCGGTGCTCATCGGAGCGATTGCCAAGGCTACACCTTGGAACGAGA 



Qy 908 



CTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTACCAATTG 

MM I II Mill III II li I >l I 

Db 860 CAGATTACAAGGGACCCTATCCCCTGACCGTGGACGAGACGAGCATGATTCTGCCCATGG 

TTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTTTCTGCTG 

I iiilllll III INI I I N MIIIIN I II II INN N < 

TGCTGCAGTACCTCACGCCTGACTTCGTGTCCTTCTTTGGATTGGGCGCTGTTTCCGCCG 



Qy 958 
Db 920 



Qy 

Db 



nv 1028 CTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCACGGAACA 

I II mil M M II II II I II I I I INN 

CCGTGATGTCCTCCGCCGACTCCTCGGTGCTCTCCGCCGCCTCCATGTTCGCTCGGAACG 



Db 980 
QY 



859 

967 

919 

1027 

979 

1087 

1039 



1088 TCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTTATGCGAA 1147 

I III I I III I II II M II II I mill INNII MINI 
Db 1040 TGTACAAATTGATTTTCCGTCAGAAGGCGTCCGAGATGGAAATCATTTGGGTGATGCGAG 1099 

Ov 1148 TCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAAACTGTGT 1207 

n I 1 mil I N Nil I III III I I 

Db 1100 TCGCCATCATTGTGGTGGGCATCCTGGCTACCATCATGGCCCTCACCATTCCCTCCATCT 1159 

Ov 1208 ATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAGCTGCTT- 1266 

III I MM II I II II M II NMI II 1 INIl mil N 

Db 1160 ACGGTTTGTGGTCCATGTGCTCGGATCTGGTCTACGTCATTCTGTTCCCGCAGCTACTGA 1219 

1267 — TGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTTTCTG 1324 

MINI IN I III I 1 N II I I N I 

1220 TGGTGGTGCACTTCAAGAAGCACTGCAACACGTACGGCAGCCTGTCGGCATACATTGTGG 1279 



1384 



Ov 1325 GCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATCTTCT 

Ml I I 11 I I II N III I II II N Nil II I 

Db 1280 CCCTGGCCATCCGACTGTCGGGCGGTGAGGCCATCTTGGGACTGGCTCCATTGATCAAGT 1339 

Ov 1385 ACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAATTTCCATTTAAAACAC 1444 

I II II III n II II I I III I I I II I I N 

Db 1340 ATCCCGGCTACGACGAGGAGACCAAGG AGCAGATGTTCCCCTTCCGCACCA 1390 

Ov 1445 TTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTATCTAGCCAAGTATCTAT 1504 

I Ml 1 II I 1111 INI NUN III ' ' 

Db 1391 TGGCCATGCTGCTCAGCCTGGTCACGCTCATCTCGGTCTCCTGGTGGACTAAAATGATGT 1450 

QV 1505 TTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCTGTTGT 1553 

III I III I II I I II I I N N INN 
Db 1451 TTGAGTCCGGCAAGTTGCCGCCCAGCTACGACTACTTCCGCTGTGTGGT 1499 



RESULT 9 

US-09-911-077A-7 

; Sequence 7, Application US/09911077A 

; Publication No. US20030114399A1 

; GENERAL INFORMATION: 

; APPLICANT: BLAKELY, RANDY D. 

; APPLICANT: APPARSUNDARAM, SUBRAMANIAM 

; APPLICANT: FERGUSON, SHAWN 

; TITLE OF INVENTION: HUMAN AND MOUSE CHOLINE TRANSPORTER cDNA 



FILE REFERENCE: VBLT:008US 

CURRENT APPLICATION NUMBER: US/09/ 911, 077A 
CURRENT FILING DATE: 2001-07-23 
NUMBER OF SEQ ID NOS : 27 
SOFTWARE: PatentIn Ver. 2.1 
SEQ ID NO 7 
LENGTH: 1985 
TYPE : DNA 

ORGANISM: Caenorhabditis elegans 
FEATURE : 
NAME/ KEY: CDS 
LOCATION: (9) . . (1739) 

US-09-911-077A-7 

Query Match 20.9%; Score 363.8; DB 10; Length 1985; 

Best Local Similarity 55.1%; Pred. No. 8.1e-99; , ^ c 

Matches 862; Conservative 0; Mismatches 637; Indels 66; Gaps ^; 

Ov 19 GGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTGGTTGGAATATGGGCTGCC 78 

^ II I I II II I I II II II I II Ml I I IIIMIIIIIIMI 1 

24 GGTATCGTGGCCATTGTGTTCTTCTACGTGCTCATTCTTGTCGTTGGAATATGGGCGGGT 



Qy 



83 



79 TGGAGAACCAAAA ACAGTGGCAGCGCAGAAGAGCGCAGCGAAGCCATC 12 6 

I I II MM 11 I II I MM I 

Db 84 AGAAAATCGAAAAGTTCAAAAGAGCTTGAATCAGAAGCCGGCGCGGCGACGGAAGAGGTG 



Qy 127 
Db 144 



ATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCTACCTGG 

II I I Ml II I II M I II II II M Mill II II Ml 

ATGTTAGCTGGGAGAAACATCGGAACTCTTGTCGGAATTTTCACAATGACTGCCACGTGG 



Qy 187 



GTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAGGTTATGGCCTA 

II II II I MIIMIMM I I IM MM M 

Db 204 GT 



143 
186 
203 

246 



TGGCGGTGCTTATATCAATGGAACCGCCGAGGCTCTGTATAATGGA^ CTC 257 



Ov 247 GCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGTGGCCTGTTCTTT 

^ II lIMM Ml IMMIII I II M II I II II II Ml 

Db 258 CTTGGATGTCAGGCTCCAGTTGGATATGCAATTTCCCTTGTTATGGGAGGACTACTTTTC 

Ov 307 GCAAAACCTATGCGTTCAAAGGGGTATGTGACCATGTTAGACCCGTTTCAGCAAATCTAT 

^ Mill Mill I I II III I II Ml I M II MM MM I Ml 

GCAAAGAAAATGCGAGAAGAAGGATATATTACAATGCTCGATCCTTTTCAGCACAAATAT 



Db 318 

Qy 367 

Db 378 

Qy 427 



Db 438 
Qy 487 



306 
317 
366 
377 
426 
437 
486 
497 
546 
557 
606 

GCAGTCGCGTACACTGACGTCGTTCAACTATTTTGCATTTTCGTCGGTTTGTGGGTTTGC 617 



GGAAAACGCATGGGCGGACTCCTGTTTATTCCTGCACTGATGGGAGAAATGTTCTGGGCT 

II MM II M II I III I MM Mill I II MM MUM I 

GGCCAACGAATCGGTGGCTTGATGTATGTTCCAGCACTTCTTGGTGAAACATTCTGGACA 

GCAGCAATTTTCTCTGCTTTGGGAGCCACCATCAGCGTGATCATCGATGTGGATATGCAC 

Mill III I II II I II II II I Ml I Mil 

GCAGCCATTCTTTCGGCACTTGGTGCAACACTGTCGGTAATTCTTGGAATCGACATGAAT 



ATTTCTGTCATCATCTCTGCACTCATTGCCACTCTGTACACACTGGTGGGAGGGCTCTAT 

M II I I I M M I II II I IMMIII MM MM 

Db 498 GCATCAGTGACCCTGTCGGCCTGTATTGCCGTATTCTACACATTCACCGGTGGATACTAT 

Ov 547 TCTGTGGCCTACACTGATGTCGTTCAGCTCTTTTGCATTTTTGTAGGGCTGTGGATCAGC 

^ I II M Mllllll Mllllll II Mill I II II MMI L_il 

Db 558 



607 GTCCCCTTTGCATTGTCACATCCTGCAGTCGCAGACATCGGGTTCACTGCTGTGCATGCC 666 

II II II II 111 M I II II Mill 

618 GTGCCGGCGGCTATGGTGCATGATGGTGCGAAGGATATTTCCAGGAATGCAG- 669 

667 AAATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCTTGGCTTGAT 726 

I I I II I I Ml \_ 

670 



GCGACTGGATTGGAGAGATTGGAGGATTCAAAGAAACATCTCTCTGGATTGAT 722 



727 AGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGGGTTCTC 786 

I III I I I I I I I I I I I I I I I M N M I I I I I II I I II I I M 
723 TGCATGCTTCTCCTTGTCTTTGGAGGAATTCCATGGCAAGTGTACTTCCAAAGAGTTCTC 782 

787 TCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGGTGCCTG 846 

I I I I I I I MM I M I i II Mill I II I II I I 

783 TCCTCAAAAACTGCTCATGGAGCACAGACGTTGTCGTTTGTGGCGGGCGTCGGATGCATT 842 

847 GTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGGAACCAG 906 

I lllll II Ml I I II II II MM Ml I 

843 CTCATGGCGATTCCACCAGCGTTGATCGGTGCAATTGCCAGGAACACAGACTGGAGAATG 902 



907 ACTGCATATGGGCTTCC AGATCCCAAGACTACAGAAGAGGCA 948 

MM III I 11 I I I 1 II M I 

903 ACTG/ 



SATTATTCCCCATGGAACAATGGAACTAAGGTCGAATCGATTCCACCGGATAAGAGA 962 



949 GACATGATTTTACCAATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGT 1008 

Mill I ill III I MM MM II I I I 

963 AACATGGTGGTCCCGTTGGTATTCCAGTATCTTACGCCAAGATGGGTCGCCTTTATTGGA 



1009 CTTGGTGCAGTTTCTGCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGT 

M II 11 I II II II M I I M II M II II I M II II I II I I I I M I I 

1023 CTCGGCGCAGTGTCGGCTGCTGTAATGTCATCTGCAGATTCATCTGTACTATCAGCAGCA 



1022 
1068 
1082 
1128 



1069 TCCATGTTTGCACGGAACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAA 

M MM MM MM I I I I I II M M MUM 

1083 TCAATGTTTGCTCACAACATCTGGAAGCTCACAATTCGCCCTCACGCGTCTGAAAAAGAA 1142 
1129 ATCGTTTGGGTTATGCGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCC 1188 

I I I I I M II II I I I MM II I 11 MINI 

1143 GTGATAATTGTGATGAGAATAGCCATCATCTGTGTTGGTATCATGGCAACCATCATGGCA 1202 



1189 
1203 
1249 



TTGCTGACGAAAACTGTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTT 

I I II I I II II M II II I M M M I M 1 II I I I I I I 
CTTACCATTCAATCCATCTATGGGCTTTGGTATCTTTGTGCAGATTTGGTCTACGTCATA 



ATCTTCCCCCAGCTGCTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTG 

MMIll M II I lllll I I I 1 1 I Ml II Mill I I I 

1263 CTCTTCCCTCAACTATTATGTGTTGTATATATGCCACGTAGCAATACGTATGGCTCATTG 



1309 GCAGGTTATGTTTCTGGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTT 

II M M II II II 111 I I I II I II 1 I I I I M 

1323 GCTGGCTATGCAGTCGGTCTTGTGCTCCGTTTGATTGGAGGCGAGCCACTTGTATCGCTG 



1248 
1262 
1308 
1322 
1368 
1382 
1428 



1369 CAGCCCTTGATCTTCTACCCTGGCTATTACCCTGATGATAATGGTATATATAATCAGAAA 

I Mill II II III III ' ' ' ' 

1383 CCAGCGTTCTTCCATTATCCAATGTATACGGATGGGG TACAGTAT 1427 



Qy 1429 TTTCCATTTAAAACACTTGCCATGGTTACATCATTCTTAACCAACATTTGCATCTCCTAT 1488 

I I I I I I I I III I II I I I I I I I I I I I I I I I I 

Db 1428 TTCCCATTCAGGACAACTGCTATGTTATCTTCAATGGCTACTATCTACATTGTATCAATA 1487 

Qy 1489 CTAGCCAAGTATCTATTTGAAAGTGGAACCTTGCCACCTAAATTAGATGTATTTGATGCT 1548 

III I I I I I I I II III II I I I I II II I M I I I I 
Db 1488 CAATCGGAGAAGCTGTTCAAATCGGGACGTTTGTCTCCGGAGTGGGACGTAATGGGTTGT 1547 

Qy 1549 GTTGT 1553 

I I II 

Db 1548 GTAGT 1552 

RESULT 10 
US-09-974-300-501 

Sequence 501, ^^plication US/09974300 
Patent No. US20020146721A1 
GENERAL INFORMATION: 
APPLICANT: Berka, Randy M, 
APPLICANT: Clausen, lb Groth 

TITLE OF INVENTION: Methods For Monitoring Multiple Gene 
TITLE OF INVENTION: Expression 
FILE REFERENCE: 10085. 500-US 
CURRENT APPLICATION NUMBER: US/09/974,300 
CURRENT FILING DATE: 2001-10-05 
PRIOR APPLICATION NUMBER: 09/680,598 
PRIOR FILING DATE: 2000-10-06 
PRIOR APPLICATION NUMBER: 60/279,526 
PRIOR FILING DATE: 2001-03-27 
NUMBER OF SEQ ID NOS : 8481 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 501 
LENGTH: 14 61 
TYPE: DNA 

ORGANISM: Bacillus lichenif ormis 
US-09-974-300-501 

Query Match 13.9%; Score 242.6; DB 9; Length 1461; 

Best Local Similarity 52.4%; Pred. No. 4.3e-62; 

Matches 663; Conservative 0; Mismatches 554; Indels 48; Gaps 4; 

TTATTGGTTGGTGGATTTACCATGACAGCTACCTGGGTCGGAGGAGGGTATATCAATGGC 210 

II II II II I II M I I II II I I II II II II II Mill II II 
TTTTTCGTCGGAATGGTGACGATGGCCGCAACATGGGTCGGCGGCGGATATATTAACGGA 184 

ACAGCTGAAGCAGTTTATGTACCAGGTTATGGCCTAGCTTGGGCTCAGGCACCAATTGGA 270 

I I I I I I I I III I I M I M II I I I I I I I I I II 
ACGGCCGAATCGACTTACA GCGACGGCCTCATCTGGGCCCAAGCGCCTTGGGGC 238 

TATTCTCTTAGTCTGATTTTAGGTGGCCTGTTCTTTGCAAAACCTATGCGTTCAAAGGGG 330 

II I I M I II I I I I II II I M I M I I I I I I I I I I 
TACGCATTGAGCCTGATTATCGGCGGTATTTTCTTCGCCAGAAAAATGCGCCGTCATCAA 2 98 

TATGTGACCATGTTAGACCCGTTTCAGCAAATCTATGGAAAACGCATGGGCGGACTCCTG 390 
I I I I I II I I I I I I I I III M I I I I I I I I II I I I I I 



Qy 


151 


Db 


125 


Qy 


211 


Db 


185 


Qy 


271 


Db 


239 


Qy 


331 


Db 


299 



391 TTTATTCCTGCACTGATGGGAGAAATGTTCTGGGCTGCAGCAATTTTCTCTGCTTTGGGA 450 

I IN II II III I I nil Ml _ ILiiJLlL-LiLliiiL 



I III II II III I I I M I I I I I I III II II • I ' > ' ' ' 

TATATACCGGCGCTGTTAGGAGAATTGTTTTGGAGCGCCGCGATCTTAACGGCATTGGGC 



359 

4 5 1 GCCACCATCAGCGTGATCATCGATGTGGATATGCACATTTCTGTCATCATCTCTGCACTC 510 

111 III 11 I I I M I I I I I I I I II IN I I I I I I 

419 ACGACTTTCGGAATGATTCTGAATATCGATTTCCAAACGTCGATTATTCTTTCGGCGATG 478 
511 ATTGCCACTCTGTACACACTGGTGGGAGGGCTCTATTCTGTGGCCTACACTGATGTCGTT 570 

II Mil II II III IIIM II 1 M II 1 III Mini M 

479 ATCGCCATCGCTTATACGGTGGCCGGAGGCATGTGGGCAGTTGCTTTCACAGATGTCTTT 538 

571 CAGCTCTTTTGCATTTTTGTAGGGCTGTGGATCAGCGTCCCCTTTGCATTGTCACATCCT 630 

M I M mill I Mil Ml I MM! I II Ml II 

539 CAAATGATTGTCATTTTGCTCGGGCTGTTTTTAGTCGTCCCATTTGTATTGTCGAATGTC 598 

631 GCAG TCGCAGACATCGGGTTCACTGCTGTGCATGCCAA 668 

II i III II I I II I II I 

599 GGTGCTCTCGATAGCGTATGGGCAAATTACAGGCACGATTTCGGCAGCAGCGCCAATCTG 658 

669 ATACCAAAAGCCGTGGCTGGGAACTGTTGACTCATCTGAAGTCTACTCT TGGCTT 723 

111 II MM M I M I I Ml 

659 CTTCCGCCGTTGGACGGCTGGAAAAACCCGGATTGGGGAAACCTGTTTTGGAACTGGTGG 718 

724 GATAGTTTTCTGTTGTTGATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGGGTT 783 

Mill II I M II I M II 111 I MM! I I IMIIIIII I ill 
719 GATAATGCGCTCCTCTTAATTTTCGGAGGTATCGCATGGCAGGTGTACTTTCAGCGCGTT 778 

784 CTCTCTTCTTCCTCAGCCACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGGTGC 843 

I I I I I 1 I I I I M I I II I I I M I I III 

779 CTTTCGGCAAAATCGGAAAGCGCCGCCATGTGGCAGTCGATAATTGCCGGAGTGATCTGC 838 

8 4 4 CTGGTGATGGCCATCCCAGCCATACTCATTGGGGCCATTGGAGCATCAACAGACTGGAAC 903 

1 I I M II I I I M 1 I M II M III " " " " ' „o 

839 ATCATTGCCGCCATTCCGTGCGTAATCATCGGAGCTGCCGGAAACAGTACCGATTGGAGC 898 

904 CAGACTGCATATGGGCTTCCAGATCCCAAGACTACAGAAGAGGCAGACATGATTTTACCA 963 

II II I I II M I I I I II II I I M 11 

899 CTGTTCGGAGCGAGCGCTCCGGATAACCCGGCG ATGATTTTGCCG 943 

964 ATTGTTCTGCAGTATCTCTGCCCTGTGTATATTTCTTTCTTTGGTCTTGGTGCAGTTTCT 1023 

II II II I 11 I 11 I I I M I II M 1 I I I 

944 CAAACGCTTGCGTATTTGACGCCAGGAATCATCGCAGGCCTCGGCTTGGGTGCAATCGCA 1003 



1024 GCTGCTGTTATGTCATCAGCAGATTCTTCCATCTTGTCAGCAAGTTCCATGTTTGCACGG 

,1111111 I I I I I I Mill I I I II 



1083 



I I I I I 1 I 1 I I M II II 11 1 I I M I II Mill M I M 

1004 GCAGCCGTCATGTCAAGCATGGACTCATCGATTCTATCGGCATCATCAATGGCCGCATGG 1063 

1084 AACATCTACCAGCTTTCCTTCAGACAAAATGCTTCGGACAAAGAAATCGTTTGGGTTATG 1143 

I I II M II I MM I 11 I I I 1 I M I I M 1 

1064 AATATTTACCGTCCGCTCATCAAGCCGAAGGCCACCCAAAAACAGCTGCAAAAAGTCGTC 1123 

1144 CGAATCACAGTGTTTGTGTTTGGAGCATCTGCAACAGCCATGGCCTTGCTGACGAAAACT 1203 

I I II I II II I I I I M II I I I M I II II MM 

1124 AAACGCTCAATCATTTTGTTCGGCGCGGGAGCAGCGGTCATCGCGCTGAATGTCAAAAGC 1183 
1204 GTGTATGGGCTCTGGTACCTCAGTTCTGACCTTGTTTACATCGTTATCTTCCCCCAGCTG 1263 



1 I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I 

Db 1184 GTTTATACTTTATGGTATTTGGCTTCGGATTTAGTTTATTGCATTCTTTTTCCCCAGTTA 1243 

Qy 1264 CTTTGTGTACTCTTTGTTAAGGGAACCAACACCTATGGGGCCGTGGCAGGTTATGTTTCT 1323 



I mill Ml 'I I II iL[iLi„..LiLiLLiL..J 

Db 1244 



ACAATGGCCCTCTTTTATAAAAGAGCAAATCTTTACGGGTCGATTGCTGGATTTGCAGTT 1303 



Ov 1324 GGCCTCTTCCTGAGAATAACTGGAGGGGAGCCATATCTGTATCTTCAGCCCTTGATCTTC 1383 

I II I Mill I II II II II I III III I 1 

Db 1304 GCAGTCATTCTGAGGCTCGGCGGTGGTGAACCCGCATTCGGCATTCCGCCGCTTCTGCCG 1363 

Qy 1384 TACCC 1388 

II I I 

Db 1364 TATCC 1368 



RESULT 11 

US-09-911-077A-19/C 

Sequence 19. Application US/09911077A 
Publication No. US20030114399A1 
GENERAL INFORMATION: 
APPLICANT: BIAKELY, RANDY D. 
APPLICANT: APPARSUNDARAM, SUBRAMANIAM 
APPLICANT: FERGUSON, SHAWN 

TITLE OF INVENTION: HUMAN AND MOUSE CHOLINE TRANSPORTER cDNA 
FILE REFERENCE: A/BLT:008US 

CURRENT APPLICATION NUMBER: US/ 09/ 911 , 077A 
CURRENT FILING DATE: 2001-07-23 
NUMBER OF SEQ ID NOS : 27 
SOFTWARE: Patent In Ver. 2.1 
SEQ ID NO 19 

LENGTH: 119040 
TYPE: DNA 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/KEY: modif ied_base 
LOCATION: (2347) . . (90873) 
OTHER INFORMATION: N = A, C, G or T/U 
US-09-911-077A-19 

Query Match 10.4%; Score 180.8; DB 10; Length 119040; 

Best Local Similarity 98.9%; Pred. No. 6.3e-42; 

Matches 182; Conservative 0; Mismatches 2; Indels 0, Gaps 



Qy 


1 


ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 

1 1 1 1 M 1 I II II 1 1 M II 1 M M M 1 M 1 1 1 1 II II 1 II 1 II 1 1 1 1 1 1 1 1 

ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 


60 


Db 


94584 


94525 


Qy 


61 


GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 

MMMIIIIMIIIIIMIIIII lllllllltllllllllMllllllMlIM 

GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 


120 


Db 


94524 


94465 


Qy 


121 


GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 

1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 N 1 1 1 1 II 1 1 1 

GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGGT 


180 


Db 


94464 


94405 



Qy 



181 ACCT 184 



Db 94404 ACGT 94401 



RESULT 12 

US-09-911-077A-14/C 

Sequence 14, Application US/09911077A 
Publication No. US20030114399A1 
GENERAL INFORMATION: 
APPLICANT: BLAKELY, RANDY D. 
APPLICANT: APPARSUNDARAM, SUBRAMANIAM 
APPLICANT: FERGUSON, SHAWN 

TITLE OF INVENTION: HUMAN AND MOUSE CHOLINE TRANSPORTER cDNA 
FILE REFERENCE: VBLT:008US 

CURRENT APPLICATION NUMBER: US/09/ 911 , 077A 
CURRENT FILING DATE: 2001-07-23 
NUMBER OF SEQ ID NOS : 27 
SOFTWARE: Patentin Ver. 2.1 
SEQ ID NO 14 

LENGTH: 142299 
TYPE : DNA 

ORGANISM: Artificial Sequence 

FEATURE: a_v, 4- • ^ 

OTHER INFORMATION: Description of Artificial Sequence: Synthetic 

OTHER INFORMATION: Primer 

FEATURE: 

NAME/KEY: modif ied_base 
LOCATION: (1305) . . (127835) 
OTHER INFORMATION: N = A, C, G or T/U 
US-09-911-077A-14 

Query Match 10.4%; Score 180.8; DB 10; Length 142299; 

Best Local Similarity 98.9%; Pred. No. 7.2e-42; ■ 

Matches 182; Conservative 0; Mismatches 2; Indels 0, Gaps 

ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 60 

IMMIIMMIIIIIIMMIMIINMIMIMIIlMIMMMIItMIIIIIII 

ATGGCTTTCCATGTGGAAGGACTGATAGCTATCATCGTGTTCTACCTTCTAATTTTGCTG 64163 



Qy 


1 


Db 


64222 


Qy 


61 


Db 


64162 


Qy 


121 


Db 


64102 


Qy 


181 


Db 


64042 



GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 

I I I I I I I I I I I M I I I I M M M I I I I M I M I I I I I I I I N I I I I I I I I M I I I I I I I I 

GTTGGAATATGGGCTGCCTGGAGAACCAAAAACAGTGGCAGCGCAGAAGAGCGCAGCGAA 
GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGCT 

I I I I I I I I I 1 1 I I I I M 1 I M I I 1 I M 1 I I M I I I I M I I I I M M M I I I I I M I I I I 

GCCATCATAGTTGGTGGCCGAGATATTGGTTTATTGGTTGGTGGATTTACCATGACAGGT 



I I 



120 
64103 
180 
64043 



RESULT 13 

US'09-864-761-1838 

; Sequence 1838, Application US/09864761 
; Patent No. US2002004 8763A1 
; GENERAL INFORMATION: 



; APPLICANT: Penn, Sharron G. 
; APPLICANT: Rank, David R. 
; APPLICANT: Hanzel, David K. 
; APPLICANT: Chen, Wensheng 

; TITLE OF INVENTION: HUMAN GENOME- DERIVED SINGLE EXON NUCLEIC ACID PROBES 
USEFUL FOR 

; TITLE OF INVENTION: GENE EXPRESSION ANALYSIS BY MICROARRAY 
; FILE REFERENCE: Aeomica-X-1 

; CURRENT APPLICATION NUMBER: US/09/864, 761 

; CURRENT FILING DATE: 2001-05-23 

; PRIOR APPLICATION NUMBER: US 60/180,312 

; PRIOR FILING DATE: 2000-02-04 

; PRIOR APPLICATION NUMBER: US 60/207,456 

; PRIOR FILING DATE: 2000-05-26 

PRIOR APPLICATION NUMBER: US 09/632,366 
; PRIOR FILING DATE: 2000-08-03 

PRIOR APPLICATION NUMBER: GB 24263.6 
; PRIOR FILING DATE: 2000-10-04 
; PRIOR APPLICATION NUMBER: US 60/236,359 
; PRIOR FILING DATE: 2000-09-27 
; PRIOR APPLICATION NUMBER: PCT/USOl/00666 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/USOl/00667 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/USOl/ 00664 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/USOl/ 00669 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/USOl/ 00665 
; PRIOR FILING DATE: 2001-01-30 
; PRIOR APPLICATION NUMBER: PCT/USOl/ 00668 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: PCT/USOl/ 00663 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: PCT/USOl/00662 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: PCT/USOl/ 00661 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: PCT/USOl/00670 

; PRIOR FILING DATE: 2001-01-30 

; PRIOR APPLICATION NUMBER: US 60/234,687 

; PRIOR FILING DATE: 2000-09-21 

; PRIOR APPLICATION NUMBER: US 09/608,408 

; PRIOR FILING DATE: 2000-06-30 

; PRIOR APPLICATION NUMBER: US 09/774,203 

; PRIOR FILING DATE: 2001-01-29 

; NUMBER OF SEQ ID NOS : 49117 

SOFTWARE: Annomax Sequence Listing Engine vers. 1.1 

; SEQ ID NO 1838 

; LENGTH: 455 
TYPE: DNA 

; ORGANISM: Homo sapiens 

; FEATURE : 

; OTHER INFORMATION: MAP TO AC009963.2 

; OTHER INFORMATION: EXPRESSED IN FETAL LIVER, SIGNAL =1.2 
; OTHER INFORMATION: EXPRESSED IN ADULT LIVER, SIGNAL =1.1 
; OTHER INFORMATION: EXPRESSED IN HELA, SIGNAL =1.2 



OTHER INFORMATION: EXPRESSED IN PLACENTA, SIGNAL = 1.3 

OTHER INFORMATION: EXPRESSED IN HBLlOO, SIGNAL =0.97 

OTHER INFORMATION: EXPRESSED IN HEART, SIGNAL =1.2 

OTHER INFORMATION: EXPRESSED IN BRAIN, SIGNAL - 1.2 

OTHER INFORMATION: EXPRESSED IN BONE MARROW, SIGNAL - 1.3 
US-09-864-761-1838 

Query Match 8.9%; Score 155; DB 9; Length 455; 

Best Local Similarity 100.0%; Pred. No. 7.5e-36; 

Matches 155; Conservative 0; Mismatches 0; Indels 0; Gaps 0 
Ov 741 GATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGGGTTCTCTCTTCTTCCTCAGC 8 00 

I I I I I t I M I I 1 I I I I M I I I I I I I It I I I 1 I I I 1 I I I I I I I I I M 11 I I I I 1 I I M I I I 

Db 266 GATGCTGGGTGGAATCCCATGGCAAGCATACTTTCAGAGGGTTCTCTCTTCTTCCTCAGC 325 

Ov 801 CACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGGTGCCTGGTGATGGCCATCCC 860 

I I I I I I I I I I I M I I I It 1 I I I I I I I M I I I I N 11 1 I iiliiiiiiiiii N 1 I I I I M 

Db 326 



CACCTATGCTCAAGTGCTGTCCTTCCTGGCAGCTTTCGGGTGCCTGGTGATGGCCATCCC 385 



Qy 861 AGCCATACTCATTGGGGCCATTGGAGCATCAACAG 895 

M I I I I I I I M M I I I I I M I I M I I I I 

386 AGCCATACTCATTGGGGCCATTGGAGCATCAACAG 420 



RESULT 14 

US-10-027-632-120553/C 

Sequence 120553, Application US/10027632 
Publication No. US2 0020198371A1 
GENERAL INFORMATION: 
APPLICANT: Wang, David G. . ^-^ 

TITLE OF INVENTION: Identification and Mapping of Single Nucleotide 
TITLE OF INVENTION: Polymorphisms in the Human Genome 
FILE REFERENCE: 108827.129 

CURRENT APPLICATION NUMBER: US/ 10/ 027 , 632 
CURRENT FILING DATE: 2002-04-30 
PRIOR APPLICATION NUMBER: US 60/218,006 
PRIOR FILING DATE: 2000-07-12 
PRIOR APPLICATION NUMBER: US 60/198,676 
PRIOR FILING DATE: 2000-04-20 
PRIOR APPLICATION NUMBER: US 60/193,483 
PRIOR FILING DATE: 2000-03-29 
PRIOR APPLICATION NUMBER: US 60/185,218 
PRIOR FILING DATE: 2000-02-24 
PRIOR APPLICATION NUMBER: US 60/167,363 
PRIOR FILING DATE: 1999-11-23 
PRIOR APPLICATION NUMBER: US 60/156,358 
PRIOR FILING DATE: 1999-09-28 
PRIOR APPLICATION NUMBER: US 60/146,002 
PRIOR FILING DATE: 1999-08-09 
NUMBER OF SEQ ID NOS : 325720 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 120553 
LENGTH: 943 
TYPE : DNA 
ORGANISM: Human 
US-10-027-632-120553 



Query Match 6.8%; Score 118.6; DB13; Length 943; 

Best Local Similarity 99.2%; Pred. No. 1.5e-24; 

Matches 118; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Ov 17 6 CAGCTACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAG 235 

MIIMIIIM IIIMIIIIIMMIIMIMIMMIIIIIIIMMM 

Db 589 CAGCTACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAG 530 

Ov 236 GTTATGGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGT 294 

IIIIIIIIIMIIIIIMIIMIMIMhlllllllMIIIIMMIIMIIIIIIM 

Db 529 GTTATGGCCTAGCTTGGGCTCAGGCACCARTTGGATATTCTCTTAGTCTGATTTTAGGT 471 



RESULT 15 

US-10-027-632-120553/C 

Sequence 120553, Application US/10027632 
Publication No. US20030204075A9 
GENERAL INFORMATION: 
APPLICANT: Wang, David G. 

TITLE OF INVENTION: Identification and Mapping of Single Nucleotide 
TITLE OF INVENTION: Polymorphisms in the Human Genome 
FILE REFERENCE: 108827.129 

CURRENT APPLICATION NUMBER: US/ 10/ 027 , 632 
CURRENT FILING DATE: 2002-04-30 
PRIOR APPLICATION NUMBER: US 60/218,006 
PRIOR FILING DATE: 2000-07-12 
PRIOR APPLICATION NUMBER: US 60/198,676 
PRIOR FILING DATE: 2000-04-20 
PRIOR APPLICATION NUMBER: US 60/193,483 
PRIOR FILING DATE: 2000-03-29 
PRIOR APPLICATION NUMBER: US 60/185,218 
PRIOR FILING DATE: 2000-02-24 
PRIOR APPLICATION NUMBER: US 60/167,363 
PRIOR FILING DATE: 1999-11-23 
PRIOR APPLICATION NUMBER: US 60/156,358 
PRIOR FILING DATE: 1999-09-28 
PRIOR APPLICATION NUMBER: US 60/146,002 
PRIOR FILING DATE: 1999-08-09 
NUMBER OF SEQ ID NOS : 325720 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 120553 
LENGTH: 943 
TYPE : DNA 
ORGANISM: Human 
US-10-027-632-120553 

Query Match 6.8%; Score 118.6; DB 16; Length 943; 

Best Local Similarity 99.2%; Pred. No. 1.5e-24; 

Matches 118; Conservative 1; Mismatches 0; Indels 0; Gaps 0 
Ov 176 CAGCTACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAG 235 

I I I I I I I I I I I I I I I I I I I M M M M I I I I I I I I I I I I I I I M I I I M I I I I I I I I I M 

Db 589 CAGCTACCTGGGTCGGAGGAGGGTATATCAATGGCACAGCTGAAGCAGTTTATGTACCAG 530 

Qy 236 GTTATGGCCTAGCTTGGGCTCAGGCACCAATTGGATATTCTCTTAGTCTGATTTTAGGT 294 

MMIIIIIIIMIIMMIIMIIIMIrlllMIIIIM MIMIIIIIIM 

Db 529 GTTATGGCCTAGCTTGGGCTCAGGCACCARTTGGATATTCTCTTAGTCTGATTTTAGGT 471 



Search completed: September 29, 2004, 05:05:29 
Job time : 1195 sees 



